Skip to content

Data Platform Engineer · SF Bay Area

Jordan Lewis

I take data platforms from proof of concept to production.

Right now that's a data lakehouse on Kubernetes: one query engine over 30+ data sources that powers a customer-facing product and about a dozen internal teams. Plus a Kafka streaming platform, AI tooling, and the on-call and disaster-recovery that keep it running.

Jordan Lewis

Across the data lifecycle

  1. Data modeling & ETL
  2. Lakehouse & streaming platform
  3. Semantic layer
  4. Data products & APIs
  5. Analytics & BI

The platform I run, by the numbers

99.9%
Lakehouse uptime
10+
Teams on the platform
30+
Data sources, one engine
8
Services on Kubernetes

What I do

Across the platform

I work across the stack that makes analytics possible: storage, query engines, streaming, identity, and governance.

Data lakehouse platforms

A single place to query the company's data wherever it lives, on-prem or in the cloud, without copying it somewhere first. I run it on Dremio and an Apache Iceberg catalog: one query engine over more than 30 data sources. It backs a customer-facing product and the internal teams that build on it.

Reliability & operations

Keeping a service people depend on running and trustworthy: automated deploys, on-call and incident response, a disaster-recovery plan, and the dashboards that catch trouble early.

Streaming & AI tooling

Moving data the moment it's created instead of waiting for a nightly batch, with a Kafka streaming platform whose events land straight in the lakehouse. On top, AI tooling that documents the data catalog and answers questions about it from your code editor.

Selected work

Things I've built

Writing

Recent posts

Let's talk.

Happy to talk data platforms, lakehouses, or where AI actually earns its keep in infrastructure. LinkedIn is the fastest way to reach me.