Edeh Emeka N. ChuquEmeka

Hi, I'm Edeh Emeka N.

Data and Platform Engineer focused on building reliable and scalable data platforms on AWS and GCP. I care a lot about automation and infrastructure as code.

I enjoy designing systems from start to finish - real-time streaming, batch analytics, data transformation, orchestration, and analytics. Before moving into data engineering, I spent six years in real estate, which helps me bring real business understanding to technical problems.

Enterprise Data Platform

Right now I design and run a complete production-grade data platform on AWS. All the main pieces live together in my dedicated GitHub organization:

enterprise-data-platform-emeka

Main Repositories

These projects work together as one system:

terraform-platform-infra-live - Full AWS infrastructure including VPC, S3, Glue, Redshift, MWAA, and IAM
platform-orchestration-mwaa-airflow - Airflow DAGs for orchestration
platform-glue-jobs - Bronze to Silver Spark ETL jobs
platform-dbt-analytics - Silver to Gold dbt transformations
platform-cdc-simulator - CDC event generator

View the full organization

Architecture

flowchart TD
    subgraph Source ["Source Layer"]
        direction TB
        Postgres[PostgreSQL RDS\nWAL Log] --> DMS[AWS DMS CDC]
        DMS --> S3Raw[S3 Bronze\nRaw CDC Parquet]
    end

    subgraph Processing ["Processing Layer"]
        direction TB
        Glue[Glue PySpark\nBronze to Silver] --> Silver[S3 Silver\nCleaned Parquet]
        Silver --> DBT[dbt + Athena\nSilver to Gold]
        DBT --> Gold[S3 Gold\nAggregated Parquet]
    end

    subgraph Serving ["Serving Layer"]
        direction TB
        Redshift[Redshift Serverless] --> BI[BI Dashboards]
    end

    subgraph Analytics ["Natural Language Analytics Agent"]
        direction TB
        NLQ[User NL Question] --> Agent[Analytics Agent\nECS Fargate + Claude API]
        Agent --> SchemaRes[Schema Resolver\nGlue Catalog + dbt artifacts]
        SchemaRes --> SQLGen[SQL Generator\nPartition-aware Athena SQL]
        SQLGen --> Guardrails[Guardrails\nSELECT-only, cost check]
        Guardrails --> Exec[Athena Execution]
        Exec --> Validate[Result Validator\nSanity checks]
        Validate --> Output[Chart + Insight + SQL\nAssumptions flagged]
    end

    S3Raw --> Glue
    MWAA[MWAA Airflow\nOrchestration] -->|triggers| Glue
    MWAA -->|triggers dbt| DBT
    Glue -.->|invalid records| Quarantine[S3 Quarantine]
    Gold --> Redshift
    Gold --> SchemaRes
    Exec -->|queries| Gold
    DBT -.->|uploads dbt artifacts| SchemaRes

    classDef layer fill:#f0f4f8,stroke:#333,stroke-width:2px;
    class Source,Processing,Serving,Analytics layer;

Featured Public Projects

I also have several public repositories that show my work across different tools and domains:

Databricks Asset Bundles + Real Estate Pipeline: End-to-end ELT on GCP with Delta Live Tables and medallion architecture
Real Estate Valuation Pipeline: Built with dbt Fusion, Snowflake, and AWS S3
Airflow + dbt + BigQuery Healthcare Pipeline: Full orchestration and transformation on Google Cloud
AWS Terraform Data Platform: Infrastructure as code for S3 data lake, Glue, Athena, and CI/CD
Fraud Detection and Sales Analytics Pipelines: Using dbt, Snowflake, and Tableau

These projects support what I do in my main enterprise platform and show how I apply modern data engineering in practice.

Skills and Tools

Pipelines and Processing: dbt, Apache Kafka, Databricks, Glue, Spark
Cloud and Infrastructure: AWS (S3, Glue, Athena, Redshift, IAM), Terraform, GCP
Orchestration: Apache Airflow (MWAA), GitHub Actions
Languages: Python, SQL
Visualization: Power BI, Tableau, Looker, QuickSight

Expertise

Building layered data platforms (raw, curated, and analytics layers)
Streaming and batch ELT workflows
Infrastructure as code with proper CI/CD
Automation and reliability at scale
Using domain knowledge to solve real business problems

Visit my YouTube channel to see project demos: @Data_Pipeline_Lab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Edeh Emeka N. ChuquEmeka

Achievements