Data Engineer | Azure & Big Data Enthusiast
I build scalable, production-grade data pipelines that turn raw information into business value. With a background in backend operations, I bridge the gap between technical infrastructure and business efficiency.
| Category | Technologies |
|---|---|
| Big Data & Cloud | PySpark, Databricks, Azure Data Factory (ADF), ADLS Gen2 |
| Architecture | Medallion Architecture, Delta Lake, Strategy Design Pattern |
| Automation | Azure DevOps, CI/CD, ARM Templates, Logic Apps |
| Programming | Python, SQL |
| Database | Azure SQL, MySQL, PostgreSQL |
These projects demonstrate my ability to build modular, maintainable, and production-ready data systems.
Orchestrated a production-grade, event-driven pipeline implementing Medallion Architecture.
- Key Features: Incremental loading with watermark patterns (90% performance gain), CI/CD automation via Azure DevOps, and automated failure alerts using Logic Apps.
Production-grade data processing for large-scale datasets.
- Key Features: Processed 46M+ records using PySpark, implemented the Strategy Design Pattern for modular transformation logic, and enforced strict data quality contracts.
Backend Operations & Data-Driven Support I leverage my operational background to build data systems that solve real-world business bottlenecks.
- Operations Specialist (FiveS Digital): Analyzed CRM data to optimize backend reporting, reducing resolution time by 40% through data-driven prioritization.
- Operations Analyst (Diallo): Leveraged data segmentation to target high-probability accounts, achieving a 30%+ increase in recovery rates.
I am always open to discussing Data Engineering, Cloud Architecture, or new opportunities.
- 📧 Email: ajmalkhan88083@gmail.com
- 💼 LinkedIn: linkedin.com/in/mdajmalkhan
- 🐙 GitHub: github.com/ajmal-khan2002
“I build pipelines that are as clean as the code that powers them.”
