About & résumé

A bit more about me

I'm a data platform engineer on a data mesh team at a marketing technology company. The work sits between data engineering, platform engineering, and distributed systems. Right now that means a Dremio lakehouse, a Kafka platform on Confluent for Kubernetes, and an Iceberg catalog that pulls on-prem and cloud data into one place.

Most of the job is figuring out how the pieces fit, and why they behave the way they do when they break. I'd rather own a whole system than ship a feature and walk away: the upgrades, the on-call, the unglamorous parts that decide whether anyone trusts it. Outside work I'm usually on a mountain bike somewhere around the Bay Area.

Download PDF

Recognition

How it's landed

  • Earned the top annual performance rating in 2023 ("sets a new standard," the highest on the scale), plus an internal innovation-award nomination.
  • Rated "exceeds expectations" in 2024 for taking the lakehouse to production and holding it above three nines.
  • The person the team hands complex, cross-org infrastructure work to, and the technical liaison to DevOps, networking, and cloud.

Trajectory

How I got here

  1. 2019

    B.S. and a first data job

    Finished a B.S. in Computer Science at Arizona State and started at USAA as a data engineer in Plano, Texas.

  2. 2019 – 2022

    USAA: data engineering

    Built ETL pipelines and runtime data-integrity controls in Python, IBM DataStage, and Airflow across Snowflake, Oracle, and Netezza. Led the first team in the org to move business-built models onto IT-supported infrastructure: 60% less code, 95% less monthly effort, about $84K saved per model a year. Tech-led a team of five toward the end.

  3. 2020 – 2024

    M.S. at night, full time by day

    Started an M.S. in Computer Science at Georgia Tech in 2020 and finished in 2024, taking classes at night across both jobs while working full time.

  4. 2022

    Joined the data mesh team

    Moved to RR Donnelley to help build a new internal data platform from the ground up.

  5. 2022 – 2024

    Cataloging, APIs, and the internal portal

    Built the cataloging and lineage backbone (crawlers that load thousands of assets into the governance catalog), a data-product API layer, and the team's internal web portal.

  6. 2024

    Lakehouse, from scratch

    Stood up the first Dremio lakehouse on Kubernetes with an Iceberg catalog and object storage. Wired in Kerberized HDFS and federated sources, and built the first Prometheus and Grafana observability for it.

  7. 2025 – now

    Production platform, streaming & AI

    Hardened the lakehouse into an automated multi-environment platform running above three nines. Added a Kafka streaming platform, took on-call and HA/DR, and built AI tooling on top: an LLM data steward and an MCP server.

Work Experience

  • RR Donnelley
    RR Donnelley (San Francisco Bay Area – Remote)
    August 2022 – Present
    May 2024 · Present
    Senior Data Engineer (Data Platform)

    I work on our enterprise Data Mesh platform team, focusing on the infrastructure and services that power analytics and customer facing SaaS workloads through Dremio.

    • Deploy and operate Dremio lakehouse clusters across multiple Kubernetes environments using Helm and GitLab CI/CD.
    • Maintain and improve Kubernetes infrastructure that supports enterprise analytics workloads.
    • Implement and maintain SSO and OIDC based authorization for Dremio and Superset in collaboration with identity and security teams.
    • Troubleshoot distributed system issues across Kubernetes networking, Hadoop/HDFS, Hive, Snowflake, and catalog services.
    • Integrate secure APIs into the internal Data Mesh platform so users can run and retrieve queries through the UI.
    • Migrate storage and catalog components during platform upgrades and improve Helm and Kubernetes configuration for maintainability.
    • Improve platform observability by integrating Prometheus and Thanos metrics and expanding Grafana dashboards.
    • Support data governance by improving Collibra domain structures and optimizing metadata sync jobs.
    • Build Spark, Iceberg, and Polaris examples to support analytics and engineering use cases.
    • Share platform knowledge through internal documentation and Data Mesh Guild walkthroughs.
    Aug 2022 · May 2024
    Data Engineer (Data Platform)

    Worked on the company's Data Mesh platform supporting governed data access, analytics enablement, and internal data products.

    • Built a centralized data catalog that provides a unified view of enterprise data assets across teams.
    • Designed the data mesh API layer and built MuleSoft proxy APIs over internal RPC services, collapsing a three-step auth flow into one authenticated REST call across dev, QA, and prod.
    • Integrated internal web applications with Data Mesh services to simplify data discovery for users.
    • Configured ETL pipelines supporting internal consumers and Snowflake Marketplace data products.
    • Built and launched an internal Next.js application: the team's hub for Data Mesh activity and dataset metrics.
    • Wrote internal documentation and engineering updates through Data Mesh Guild posts.
    • Evaluated and deployed new technologies in Kubernetes and OpenStack environments to support architecture decisions.
  • USAA
    USAA (Plano, TX – Onsite)
    September 2019 – July 2022
    Aug 2021 · July 2022
    Data Engineer
    Internal title: Software Engineer II

    Worked on enterprise data pipelines and analytics infrastructure supporting financial forecasting and modeling workloads.

    • Served as technical lead for a team of five engineers from May to July 2022, leading design discussions, code reviews, and implementation planning.
    • Built batch data processing pipelines using Domino, R, Python, Git CI/CD, and Airflow.
    • Led the first team in the organization to migrate business-developed models to IT-supported infrastructure, cutting code by 60 percent and monthly operational effort by 95 percent, roughly $84K in annual savings per model.
    • Developed ETL pipelines using Python, shell scripts, and IBM DataStage.
    • Implemented runtime validation and monitoring to maintain data integrity across multiple pipeline stages.
    • Orchestrated data workflows across Snowflake, Oracle, and Netezza databases.
    • Built visualizations for financial forecasting data using React, D3, Plotly, and Tableau.
    Sep 2019 · Aug 2021
    Associate Data Engineer
    Internal title: Software Engineer III
    • Built data pipelines that moved application data into enterprise data warehouse platforms.
    • Implemented validation controls to maintain data integrity across multiple stages of the pipeline.
    • Developed ETL jobs using Python, shell scripts, and IBM DataStage.
    • Taught quarterly internal training sessions on DataStage and ETL development practices.

Education

  • Georgia Institute of Technology
    2024
    M.S. Computer Science
    Georgia Institute of Technology
  • Arizona State University
    2019
    B.S. Computer Science
    Arizona State University

Toolbox

Tools I reach for

Platforms
DremioConfluentSnowflakeSupersetCollibraMuleSoft
APIs
RESTGraphQLOAuth / OIDCApache Thrift
Streaming & data
Apache KafkaKafka ConnectSchema RegistryDebeziumApache IcebergSpark
Infra
KubernetesHelmHelmfileDockerGitLab CI/CDF5 BIG-IP
Observability
PrometheusThanosGrafana
Languages
PythonSQLTypeScriptBash