LLM / Agent Platforms
- A2A framework design
- Multi-hop reasoning agents
- RAG
- Vertex AI / Azure OpenAI
AI / Cloud / Distributed Systems
Staff-level Software Engineer / AI Platform & Cloud Architect
Building production LLM platforms, edge AI systems, and distributed data infrastructure from prototype to enterprise deployment.
Profile Focus
About
I am a senior software engineer with 10+ years building production AI, computer-vision, and distributed systems end-to-end, from low-latency C++ edge inference engines to cloud-native LLM platforms.
My work sits at the intersection of AI platform architecture, RAG systems, edge-cloud deployment, and large-scale data pipelines on GCP and AWS, with repeated ownership from prototype through commercialized enterprise deployment.
Expertise
Experience
Jan 2023 — Present
Acer — Advanced Tech BU
Lead Architect for the cloud-native AI Agent Platform on GCP. Designed a modular Agent-to-Agent (A2A) framework with dynamic runtime loading, multi-agent orchestration, real-time ASR, and large-scale RAG retrieval.
Jun 2019 — Jan 2023
Acer — Advanced Tech BU
Owned end-to-end engineering of a commercial edge-to-cloud AI platform deployed across retail and transportation. Built a real-time C++ inference engine with Cython/Python integration achieving sub-second latency in production.
Feb 2017 — Jun 2019
Acer — Advanced Tech BU
Simulation-driven model development. Built a virtual-to-physical feedback loop pairing GTA-V environments with real-time shared-memory inference for autonomous-driving perception and control.
Selected Work
01
A2A framework / RAG / multi-LLM routing
Architected a cloud-native AI platform on GCP with dynamic runtime agent loading, multi-agent orchestration, and multi-LLM routing across GPT-4 Turbo, Claude 3.5 Sonnet, and Gemini 2.0 Flash.
02
LLM workflow automation
Delivered a multilingual technical-translation agent with a four-stage verification pipeline covering exact match, AI review, generalization, and human review.
03
Commercial edge-to-cloud computer vision
Led architecture and engineering of a commercial n:n face-recognition platform with real-time C++ inference, edge hardware optimization, and cloud verification services.
04
Spark / Hadoop analytics infrastructure
Re-architected global telemetry ingestion and preprocessing systems using Spark and Hadoop MapReduce for device analytics and OTA workflows.
Career Highlights
10+ years in software engineering
Built AI products from prototype to commercial deployment
Validated RAG quality with 3,131 human-verified QA pairs
Delivered 29-language translation automation at 99.9% accuracy
Achieved 97.24% MegaFace accuracy for edge face recognition
Reduced ETL runtime from 24 hours to 2 hours
Processed about 50M-90M telemetry CSV rows per day