Ankush Handa
I have developed and managed data pipelines for various clients. I worked with one of the healthcare client wherein i was required to transform the data accumulated from different sources into a semi structured format which can be consumed by various downstream applications.
5 yrs 10 mos
Data Engineer
Boeing
1 month
IIIT Bengaluru
Summary
I am looking for opportunities where i can enhance my skill sets so i can justify with my experience and expected role from the company. I am highly enthusiast in machine learning. I have good experience in development of various ML models. I am seeking opportunity in same domain to get expertise.
... See More
Skills
Nothing specified
Career Journey (5 yrs 10 mos)
Data Engineer
Boeing
Dec 2018 - Present
2 yrs 1 mo
Description: 
TQA is the Total Quantity of the part in all the Next Higher assemblies (NHA) per Aircraft. The TQA specifies the number of units of the subject part (in implied units of each) used in the complete assembly of one aircraft for line maintenance purposes. Our task was to improve the performance of TQA workflow for TQA and QPA calculation which will be used by various downstream airline applications for demand planning and contract proposals. My responsibilities involved: 1. Requirement gathering, designing, documentation & analysis of the key deliverables. 2. Performing transformations in Apache spark for TQA calculation serving various downstream applications. 3. Automating the entire process of ingestion and curation with scheduling in oozie. 4. Designed and developed metrics for cluster usage for different data sources that will be used by the business for making appropriate decisions. 5. Developed on scripts to transfer data from on premise to cloud. 6. Worked on Generic Ingestion API, to load the data into Data Lake through which data scientists can ingest adhoc data into Data lake without involving data engineers.(POC)
... See More
IIIT Bengaluru
Post Graduate Diploma
Dec 2018 - Nov 2019
10 mos

Specialization
Degree: Post Graduate Diploma
Category: null
... See More
Business Technology Analyst
Deloitte
Oct 2017 - Dec 2018
1 yr 1 mo
Description: 
A US based Health Insurance major over a period of 5 years has developed some very reliable predictive models based on AI and ML. They aimed to commercialize those applications to third party insurers at some cost. Deloitte was assigned the task to setup a cloud based commercial data lake where data will be ingested from different sources and different vendors. The data in end will be consumed by ML models to predict the industry trends. My responsibilities involved: 1. Creating the data pipeline in Apache Hadoop, Python and Spark to ingest reference table data from different source into the data lake. 2. Creating Python and Unix Shell Scripts to automate the object creation tasks and metadata for object creation task. 3. Designing and developing the complete data pipeline in Apache Spark, Scala and Python to convert the data from commercial data lake into semi-structured format in JSON and provide it for ML Data Model based applications. 4. Developed the data pipeline logic for same load to provide data feed to downstream application as needed based on Slowly Changing Dimensions concepts. 5. Designed and developed the automated SIT testing framework using python. 6. Implemented a generic data quality check framework. 7. Developed deployment scripts to automate the deployment process and integration with Devops.
... See More
Application Development Analyst
Accenture
Jan 2015 - Oct 2017
2 yrs 8 mos
Description: 
An enterprise Data-warehouse solution implemented in Big Data for a banking client. Solution aimed at replacing existing legacy system with new age data analytics technologies to make the system more versatile. Developed and designed by Accenture UK and Accenture India. 1. Requirement gathering, designing, documentation & analysis of the key deliverables for the system migration. 2. Proficient in writing complex hive queries while implementing concepts such as bucketing, partitioning with Array, Maps & Structs in Hive. 3. Designed and developed Scala APIs using Apache Spark to load data with data validation and data quality rules. 4. Designed and developed Scala codes to perform ETL on data and load data in slowly changing dimension formats in Hive Tables. 5. Designed and developed the workflow to automate the service-now defect registering portal for assignment of incidents and thus removing the manual efforts as a part of 24*7 monitoring in order to avoid missing the SLA.
... See More
Kurukshetra University
B.Tech
Sep 2010 - Jul 2014
3 yrs 10 mos

Specialization
Degree: B.Tech
Category: null
... See More
More
Nothing specified
Personality
Nothing specified