12/18/2025 updated


100 % available
Data Engineer | Databricks | Microsoft Fabric | ADF | Synapse | Certified Data Engineer
Noida, India Masters of Technology in Computer Science (ML & AI)
About me
Data Engineer with expertise in Python, SQL, Databricks, and Power BI. Experienced in building end-to-end ETL pipelines, medallion architecture, PySpark transformations, dashboard automation, and LLM-driven analytics to convert complex data into actionable insights.
Data Engineering and Pipeline Development
Advanced expertise in designing and building automated ETL pipelines, data transformation using SQL and Python, and implementing scalable data architectures with Azure Databricks and cloud platforms
Business Intelligence and Data Visualization
Proficient in creating interactive dashboards and reports using Power BI, Fabric, and Azure Databricks for data-driven decision making and insight generation
Machine Learning and Predictive Analytics
Experience in developing predictive models, quantitative analysis, and implementing machine learning solutions for forecasting trends and business optimization
Programming and Development
Strong programming skills in Python, SQL, and VBA for data processing and automation solutions
Cloud Platforms and DevOps
Experience with Azure DevOps, Git, and cloud-based data management systems for efficient deployment and version control
Big Data Analytics Tools
Proficiency in handling large datasets using specialized analytics tools and platforms for enterprise-level data processing
Database Management
Expertise in SQL Server, Microsoft Excel, and various database technologies for data storage and retrieval optimization
Data Security and Compliance
Knowledge of secure encryption protocols and data security compliance measures for enterprise environments
Advanced expertise in designing and building automated ETL pipelines, data transformation using SQL and Python, and implementing scalable data architectures with Azure Databricks and cloud platforms
Business Intelligence and Data Visualization
Proficient in creating interactive dashboards and reports using Power BI, Fabric, and Azure Databricks for data-driven decision making and insight generation
Machine Learning and Predictive Analytics
Experience in developing predictive models, quantitative analysis, and implementing machine learning solutions for forecasting trends and business optimization
Programming and Development
Strong programming skills in Python, SQL, and VBA for data processing and automation solutions
Cloud Platforms and DevOps
Experience with Azure DevOps, Git, and cloud-based data management systems for efficient deployment and version control
Big Data Analytics Tools
Proficiency in handling large datasets using specialized analytics tools and platforms for enterprise-level data processing
Database Management
Expertise in SQL Server, Microsoft Excel, and various database technologies for data storage and retrieval optimization
Data Security and Compliance
Knowledge of secure encryption protocols and data security compliance measures for enterprise environments
Languages
EnglishFluent
Project history
• Patent Data : Designed and built an automated ETL pipeline to ingest U.S. Patent Office data by downloading ZIP files, extracting XML, parsing required fields, converting to CSV, and loading the curated dataset into SQL tables,improving data availability and processing efficiency.
• PitchBook Startup Data : Developed a full medallion-architecture data pipeline for PitchBook startup datasets,
including ingestion to Bronze, complex transformations to Silver, and business-ready Gold tables; enabled insights by building an interactive Power BI dashboard for end-user consumption.
• CPO Buyer Needs Generation : Engineered an LLM-powered data pipeline in Databricks that processes analyst discussion summaries and customer questions to generate buyer-need recommendations using GPT endpoints,significantly enhancing Gartner’s content intelligence capabilities.
Certificates
DP-600
Microsoft2025