About & résumé

A bit more about me

I'm a data platform engineer on a data mesh team at a marketing technology company. The work sits between data engineering, platform engineering, and distributed systems. Right now that means a Dremio lakehouse, a Kafka platform on Confluent for Kubernetes, and an Iceberg catalog that pulls on-prem and cloud data into one place.

Most of the job is figuring out how the pieces fit, and why they behave the way they do when they break. I'd rather own a whole system than ship a feature and walk away: the upgrades, the on-call, the unglamorous parts that decide whether anyone trusts it. Outside work I'm usually on a mountain bike somewhere around the Bay Area.

Download PDF

Recognition

How it's landed

Earned the top annual performance rating in 2023 ("sets a new standard," the highest on the scale), plus an internal innovation-award nomination.
Rated "exceeds expectations" in 2024 for taking the lakehouse to production and holding it above three nines.
The person the team hands complex, cross-org infrastructure work to, and the technical liaison to DevOps, networking, and cloud.

Trajectory

How I got here

2019
B.S. and a first data job
Finished a B.S. in Computer Science at Arizona State and started at USAA as a data engineer in Plano, Texas.
2019 – 2022
USAA: data engineering
Built ETL pipelines and runtime data-integrity controls in Python, IBM DataStage, and Airflow across Snowflake, Oracle, and Netezza. Led the first team in the org to move business-built models onto IT-supported infrastructure: 60% less code, 95% less monthly effort, about $84K saved per model a year. Tech-led a team of five toward the end.
2020 – 2024
M.S. at night, full time by day
Started an M.S. in Computer Science at Georgia Tech in 2020 and finished in 2024, taking classes at night across both jobs while working full time.
2022
Joined the data mesh team
Moved to RR Donnelley to help build a new internal data platform from the ground up.
2022 – 2024
Cataloging, APIs, and the internal portal
Built the cataloging and lineage backbone (crawlers that load thousands of assets into the governance catalog), a data-product API layer, and the team's internal web portal.
2024
Lakehouse, from scratch
Stood up the first Dremio lakehouse on Kubernetes with an Iceberg catalog and object storage. Wired in Kerberized HDFS and federated sources, and built the first Prometheus and Grafana observability for it.
2025 – now
Production platform, streaming & AI
Hardened the lakehouse into an automated multi-environment platform running above three nines. Added a Kafka streaming platform, took on-call and HA/DR, and built AI tooling on top: an LLM data steward and an MCP server.

Work Experience

RR Donnelley (San Francisco Bay Area – Remote)
August 2022 – Present
May 2024 · Present
Senior Data Engineer (Data Platform)
I work on our enterprise Data Mesh platform team, focusing on the infrastructure and services that power analytics and customer facing SaaS workloads through Dremio.
Deploy and operate Dremio lakehouse clusters across multiple Kubernetes environments using Helm and GitLab CI/CD.
Maintain and improve Kubernetes infrastructure that supports enterprise analytics workloads.
Implement and maintain SSO and OIDC based authorization for Dremio and Superset in collaboration with identity and security teams.
Troubleshoot distributed system issues across Kubernetes networking, Hadoop/HDFS, Hive, Snowflake, and catalog services.
Integrate secure APIs into the internal Data Mesh platform so users can run and retrieve queries through the UI.
Migrate storage and catalog components during platform upgrades and improve Helm and Kubernetes configuration for maintainability.
Improve platform observability by integrating Prometheus and Thanos metrics and expanding Grafana dashboards.
Support data governance by improving Collibra domain structures and optimizing metadata sync jobs.
Build Spark, Iceberg, and Polaris examples to support analytics and engineering use cases.
Share platform knowledge through internal documentation and Data Mesh Guild walkthroughs.
Aug 2022 · May 2024
Data Engineer (Data Platform)
Worked on the company's Data Mesh platform supporting governed data access, analytics enablement, and internal data products.
Built a centralized data catalog that provides a unified view of enterprise data assets across teams.
Designed the data mesh API layer and built MuleSoft proxy APIs over internal RPC services, collapsing a three-step auth flow into one authenticated REST call across dev, QA, and prod.
Integrated internal web applications with Data Mesh services to simplify data discovery for users.
Configured ETL pipelines supporting internal consumers and Snowflake Marketplace data products.
Built and launched an internal Next.js application: the team's hub for Data Mesh activity and dataset metrics.
Wrote internal documentation and engineering updates through Data Mesh Guild posts.
Evaluated and deployed new technologies in Kubernetes and OpenStack environments to support architecture decisions.
USAA (Plano, TX – Onsite)
September 2019 – July 2022
Aug 2021 · July 2022
Data Engineer
Internal title: Software Engineer II
Worked on enterprise data pipelines and analytics infrastructure supporting financial forecasting and modeling workloads.
Served as technical lead for a team of five engineers from May to July 2022, leading design discussions, code reviews, and implementation planning.
Built batch data processing pipelines using Domino, R, Python, Git CI/CD, and Airflow.
Led the first team in the organization to migrate business-developed models to IT-supported infrastructure, cutting code by 60 percent and monthly operational effort by 95 percent, roughly $84K in annual savings per model.
Developed ETL pipelines using Python, shell scripts, and IBM DataStage.
Implemented runtime validation and monitoring to maintain data integrity across multiple pipeline stages.
Orchestrated data workflows across Snowflake, Oracle, and Netezza databases.
Built visualizations for financial forecasting data using React, D3, Plotly, and Tableau.
Sep 2019 · Aug 2021
Associate Data Engineer
Internal title: Software Engineer III
Built data pipelines that moved application data into enterprise data warehouse platforms.
Implemented validation controls to maintain data integrity across multiple stages of the pipeline.
Developed ETL jobs using Python, shell scripts, and IBM DataStage.
Taught quarterly internal training sessions on DataStage and ETL development practices.

Education

2024
M.S. Computer Science
Georgia Institute of Technology
2019
B.S. Computer Science
Arizona State University

Toolbox

Tools I reach for

Platforms: DremioConfluentSnowflakeSupersetCollibraMuleSoft
APIs: RESTGraphQLOAuth / OIDCApache Thrift
Streaming & data: Apache KafkaKafka ConnectSchema RegistryDebeziumApache IcebergSpark
Infra: KubernetesHelmHelmfileDockerGitLab CI/CDF5 BIG-IP
Observability: PrometheusThanosGrafana
Languages: PythonSQLTypeScriptBash

How it's landed

How I got here

B.S. and a first data job

USAA: data engineering

M.S. at night, full time by day

Joined the data mesh team

Cataloging, APIs, and the internal portal

Lakehouse, from scratch

Production platform, streaming & AI

Work Experience

Education

Tools I reach for