// Principal Data Engineer

Ashish
Kumar

Specializing in 

ashish@data-architect ~ zsh

Designing Data Systems
That Scale

Ashish Kumar

I'm a Principal Data Engineer with 11+ years of experience designing and delivering enterprise-grade data platforms across fintech and analytics verticals. Currently at Sigmoid Analytics, Bangalore, I lead architecture decisions for large-scale data infrastructure spanning ingestion, transformation, and serving layers.

My work sits at the intersection of engineering rigour and business impact — building pipelines that are not just performant, but observable, governable, and maintainable. I'm opinionated about data contracts, schema evolution, and treating data infrastructure like a product.

Location Bangalore, India
Degree B.Tech, GIET Gunupur
Status Open to Consulting
11+
Years Exp.
2
Cloud Platforms
Pipelines Shipped

The Full Data Lifecycle

Tools and platforms I use across every layer of the data engineering stack.

Ingestion
Apache Kafka AWS Kinesis Azure Event Hubs Debezium Fivetran
⚙️
Processing
Apache Spark PySpark AWS EMR Azure Databricks Flink
🗄️
Storage
AWS S3 Azure Data Lake Delta Lake Snowflake Apache Iceberg
🔄
Transformation
dbt Spark SQL SQL Python
🎼
Orchestration
Apache Airflow AWS MWAA Prefect dbt Cloud
☁️
Cloud
AWS Azure Glue Redshift Synapse
🤖
AI / ML
MLflow SageMaker Feature Store Vector DBs
🛡️
Governance
Apache Atlas Unity Catalog Glue Catalog Great Expectations

Case Studies

High-level architectural patterns I've designed and implemented at scale.

Pattern 01

Lambda Architecture

Hybrid batch + streaming design for low-latency analytics with historical reprocessing capability.

graph LR S([Data Source]) --> B[Batch Layer\nSpark / EMR] S --> SP[Speed Layer\nKafka + Flink] B --> SV[(Serving Layer\nSnowflake)] SP --> SV SV --> C([Consumers\nBI / APIs]) style S fill:#21262d,stroke:#58a6ff,color:#e6edf3 style C fill:#21262d,stroke:#3fb950,color:#e6edf3 style SV fill:#21262d,stroke:#d2a8ff,color:#e6edf3

Pattern 02

Modern Lakehouse

Delta Lake-based medallion architecture (Bronze → Silver → Gold) with dbt transformation layer.

graph TD I[Kafka / Kinesis\nIngestion] --> B[(Bronze\nRaw S3 / ADLS)] B -->|Spark ETL| SI[(Silver\nDelta Lake)] SI -->|dbt models| G[(Gold\nAnalytics Layer)] G --> BI[BI Tools\nPower BI / Tableau] G --> API[Data APIs] style I fill:#21262d,stroke:#f97316,color:#e6edf3 style B fill:#21262d,stroke:#cd7f32,color:#e6edf3 style SI fill:#21262d,stroke:#c0c0c0,color:#e6edf3 style G fill:#21262d,stroke:#ffd700,color:#e6edf3

Pattern 03

Real-time Streaming Pipeline

Event-driven ingestion with PySpark Structured Streaming, micro-batch processing and alerting.

graph LR E([Event Sources]) --> K[Apache Kafka\nTopic] K -->|PySpark\nStructured Streaming| P[Processing\nEnrichment] P --> DL[(Delta Lake\nSink)] P --> AL[Alerting\nPagerDuty / SNS] DL --> Q[Query Layer\nDatabricks SQL] style E fill:#21262d,stroke:#58a6ff,color:#e6edf3 style K fill:#21262d,stroke:#f97316,color:#e6edf3 style AL fill:#21262d,stroke:#f85149,color:#e6edf3

Professional Journey

Sigmoid Analytics 2022 — Present
Principal Data Engineer
📍 Bangalore, India
  • Lead data platform architecture for enterprise clients, defining standards for ingestion, transformation, and data quality.
  • Designed and implemented lakehouse solutions on Azure Databricks and AWS EMR serving petabyte-scale workloads.
  • Mentored a team of 7 engineers on distributed systems design, dbt modelling, and data observability practices.
Oracle Finance 2018 — 2022
Data Engineer
📍 Bangalore, India
  • Built and maintained financial data pipelines processing millions of transactions daily using Apache Spark and Oracle Cloud.
  • Implemented data governance frameworks ensuring regulatory compliance and data lineage tracking.
  • Collaborated cross-functionally to deliver self-service analytics capabilities to business stakeholders.
GIET University 2010 — 2014
Bachelor of Computer Science
📍 Gunupur, Odisha
  • Gandhi Institute of Engineering & Technology, Odisha.
  • Foundation in algorithms, distributed systems, and database design.
Certifications Ongoing
Cloud & Data Platforms
📋 Add your actual certifications below
  • [ Add certification 1 — e.g. AWS Certified Data Engineer ]
  • [ Add certification 2 — e.g. Databricks Certified Associate ]
  • [ Add certification 3 ]

Technical Articles

Deep-dives on data architecture, platform engineering, and the modern data stack.

Let's Build Something

Whether you're looking to modernize a legacy data platform, architect a new lakehouse, or need a technical lead for your data engineering team — I'm open to the conversation.

Available for consulting & architecture reviews