Kyuyeon Park — AI Agent Engineer

01 — Work

Experience

Sendbird

AI Agent Eng Intern

Dec 2025
— Mar 2026

Shipped a production CS agent for Korea's largest food delivery platform and replaced 9-hour manual QA with a 5-minute automated evaluation pipeline.

Diagnosed tool routing failures in a live system, consolidated proxy architecture from 2 to 1 tool call, and iteratively refined prompts via batch testing — pushing CSAT to 3.4/5 at 5% regional rollout (200–250 conversations/day). Built an LLM-as-a-judge evaluator scaling to 1,000+ conversations. Shipped a Slack/Jira/GitHub issue tracker adopted team-wide with zero onboarding.

Prompt EngineeringLLM EvalToolingDjangoProduction AI

BCG

Research Analyst

Oct 2025
— Dec 2025

Found the real bug — RDB-to-VectorDB misrouting — that the team assumed was a prompt problem.

Built a ReAct-based multi-agent LLM system for enterprise sales data consolidation. Traced end-to-end data flow across agent planning, tool routing, and DB layers. Shipped a domain-specific typo correction module at 95%+ accuracy, resolving 9 business-critical retrieval failures.

LangGraphRAGMulti-agentPostgreSQL

LG AI Research

AI Data Eng Intern

Dec 2023
— Dec 2024

Rebuilt a monolithic crawl pipeline into a distributed system and cut cloud storage costs by 70%.

Re-architected a 100M+ page Korean web crawling pipeline into a distributed K8s/Redis system with memory buffering and compression. Designed a 2-step vector search tool planning system, filtering a 120k QA dataset to 20k high-quality samples (83% reduction) to replace LLM fine-tuning.

KubernetesRedisAirflowBigQueryVector DB

03 — Work

Selected Projects

01

Production CS AI Agent

End-to-end AI customer support agent for Korea's major food delivery platform. Replaced a GPT-4.1 legacy system with Claude Haiku + extended thinking. Diagnosed proxy bottlenecks and shipped targeted architecture fixes.

+21%

CSAT improvement (2.9 → 3.5)

02

LLM-as-a-Judge Eval System

Decision-tree path conformance evaluator with a sliding window pointer mechanism. Replaced manual QA across 1,000+ conversations. Proactively transferred to the Tooling team after building a high-quality prototype.

9hr → 5min

Evaluation review time

03

Web Crawl Pipeline @ Scale

Large-scale Korean web crawling pipeline feeding LLM pre-training data. Re-architected monolithic WARC builder into a distributed K8s/Redis system with memory buffering and compression.

70%

Cloud storage cost reduction

04

Issue Tracker Automation

Go-based internal tool integrating Slack, Jira, and GitHub. Auto-generates tickets from Slack threads, links PRs, syncs states bidirectionally, and summarizes root causes via LLM.

Day 1

Team adoption, zero onboarding

04 — Process

How I Work

I don't just use AI tools — I build with them as teammates.

My agents have names, roles, and autonomy. Dev-Gorani handles code, diagnoses bugs end-to-end, and commits fixes directly. I describe the problem in natural language; it ships the diff.

Orchestration Discord × Codex CLI × Claude-code

Agents coding gorani × strong gorani and more

// live session — Coding Gorani carrying out one of my requests

05 — Toolkit

Skills & Tools

Languages

Python
SQL / NoSQL
TypeScript
Java

AI / ML

AI Agent Development
RAG
Prompt Engineering
LangChain / LangGraph
LLM Evaluation
Claude Code
Codex

Data Engineering

Airflow
Redis
BigQuery
PostgreSQL / DynamoDB
Apache Superset

Infrastructure

Kubernetes
Docker
n8n
AWS
GCP

Kyuyeon Park Builds.

Kyuyeon
Park
Builds.