San Jose, CA • Open to H1B Transfer

Daming Wu

Senior Software Engineer

Shipping AI-powered products end-to-end — from an ML-driven real estate analytics SaaS, to a crisis-aware mental-health agent with a paper under review at AIES 2026, to a self-hosted novel-to-video animation pipeline. Backed by 6+ years architecting microservices, distributed systems, and full-stack platforms in production.

6+
Years Engineering
5M+
Daily Requests Served
AIES 2026
Paper Under Review

Core Skills

☁️

Cloud & Infrastructure

AWS Azure GCP (Cloud Run / SQL) Cloudflare Workers Docker Kubernetes Terraform
🔧

Application Stack

Python Go TypeScript FastAPI React Next.js
🤖

AI & Machine Learning

Claude / Anthropic SDK Multi-agent PyTorch XGBoost Whisper LangChain
💾

Data & Databases

MySQL Redis Snowflake Kafka Airflow BigQuery
🔐

DevOps & Monitoring

Jenkins GitLab CI/CD Grafana Prometheus ELK Stack Splunk

System Design

Microservices REST/gRPC Event-driven Scalability Security

Featured Projects

01

Nestlyze — Real Estate Analytics SaaS

Production U.S. real estate analysis platform. Trained a gradient-boosted AVM that hits 18.6% MAPE in NYC and 15.2% MAPE in Connecticut on holdout data, paired with a 6-agent Claude-powered analysis pipeline (school / commute / climate / cost trajectory / neighborhood / valuation). Full GA4 + Search Console funnel instrumentation; deployed on GCP Cloud Run + Cloud SQL behind Cloudflare.

React FastAPI Claude XGBoost Cloud Run + SQL Cloudflare
🏡
02

Stay — Crisis-Aware Mental Health AI

Open-source mental-health companion built on Next.js + Claude with an explicit crisis-detection layer (988 / Crisis Text Line / DV / Childhelp bridging) and a clinician-reviewed safety protocol. First-author on two papers submitted to AIES 2026 on safety-critical conversational AI; informal clinician review completed; prompt + skill distributed under a custom safety license.

Next.js TypeScript Claude Skill SDK Safety eval
🫂
03

知几 / Mystic Lens — AI Divination App

Full-stack consumer AI app with an 8-agent reasoning pipeline and a novel /ritual gesture flow that uses the device camera for an interactive divination experience. Migrated the production stack off Render onto GCP Cloud Run + Cloud SQL fronted by a Cloudflare Worker, with a pre-compute strategy that cut repeat-reading inference cost by ~83%.

React FastAPI Claude Cloud Run Cloudflare Worker
🔮
04

Video Repurpose Agent

End-to-end automation that pulls YouTube videos, transcribes & dubs them into Chinese with cloned voices, and uploads to multiple Chinese platforms. Operates an 18-channel YouTube matrix on automated systemd timers (08 / 14 / 20 uploads + 09 Telegram daily report), with multi-account session management and quota-aware scheduling.

Python Whisper CosyVoice FFmpeg systemd Telegram Bot API
🎬
05

Wuxia Donghua — Novel→Video Pipeline

Self-hosted AI animation pipeline turning Chinese wuxia novels into animated short dramas. Wires together SkyReels-V2 for video, CosyVoice for voice cloning, and Kling for shot-level retries — all running locally on a single RTX 5090 with a custom GPU broker (MCP) for VRAM reservation across concurrent jobs.

PyTorch SkyReels-V2 CosyVoice CUDA MCP server
🐲

Work Experience

Sep 2022 - Present

Senior Software Engineer

Fargo Automotive of Gainesville
  • Architected microservices on AWS/Azure achieving 99.9% uptime
  • Designed scalable REST/gRPC APIs increasing throughput by 45%
  • Built LLM and PyTorch-powered services reducing manual time by 90%
  • Integrated Plaid, Stripe, Twilio APIs streamlining onboarding by 25%
  • Established observability stack reducing MTTR by 40%
  • Led 6-member team adopting TDD with 80% coverage
Oct 2020 - Sep 2022

Software Engineer

Fargo Automotive of Gainesville
  • Deployed AI-based predictive maintenance using AWS Lambda and PyTorch
  • Implemented automated training pipelines improving accuracy by 25%
  • Reduced API latency from 1.2s to 300ms
  • Built evaluation dashboard tracking model performance
  • Improved CRM API response time by 40%

Let's Connect

Open to opportunities in cloud architecture, distributed systems, and AI/ML engineering.

(352) 278-8384