Flagship platform
Production data lakehouse on Dremio + Kubernetes
A production-critical service at 99.9% uptime that a dozen teams across the company query directly.
Data Platform Engineer · SF Bay Area
I own the data stack, end to end.
I took a Dremio lakehouse from a single-VM proof of concept to a production-critical service at three nines: one SQL engine over 30+ sources across six backend types, used by about a dozen teams. Plus a Kafka platform, an LLM and MCP layer, and the on-call and HA/DR that keep it up.

Across the data lifecycle
What I do
I work across the whole stack that makes analytics possible: storage, query engines, streaming, identity, and governance.
I run Dremio and an Apache Iceberg catalog on Kubernetes. One SQL engine queries HDFS, Hive, object storage, Snowflake, Postgres, and Mongo together, with a semantic layer, SSO, and query acceleration on top.
I own the platform end to end. Multi-environment CI/CD, on-call, incident response, HA/DR planning, and the dashboards that keep a production service at three nines.
Kafka on Confluent for Kubernetes, with topics landing in the lakehouse as Iceberg tables. On top of it I build LLM and MCP tooling that documents a catalog and answers questions about it from your editor.
Selected work
Flagship platform
A production-critical service at 99.9% uptime that a dozen teams across the company query directly.
Streaming
Self-hosted streaming with end-to-end auth, running in staging. Topics land in the lakehouse as Iceberg tables.
AI / Platform
Documents a data catalog on a daily run and answers catalog questions from an engineer's editor.
Writing
Happy to talk data platforms, lakehouses, or where AI actually earns its keep in infrastructure. LinkedIn is the fastest way to reach me.