Software developer with 6.5 years of experience in design, development, testing and implementation of various stand-alone and client-server architecture-based enterprise applications using Python 3.7, Pyspark, and Java technologies and involved in Data analysis, mapping, translation & Transformation. Good working knowledge in Pyspark, Databricks, Big data pipeline implementation and Database technologies to design server applications and client interfaces.
Overview
9
9
years of professional experience
Work History
Data Engineer
Cigna Group
11.2022 - Current
Worked as an AWS, PySpark & Python Developer
Developed Glue ETL, Lambda and Python scripts
Used AWS components like DynamoDB, Step Functions SQS and SNS
Worked on provider domain to handle and engineer various provider data like organization, facility, group, practitioner, networks, and payments
Developed ETL pipelines using Databricks, PySpark and AWS Cloud Services
Design and implement a solution to manage the big data in the Databricks cluster
Developed PySpark modules on Databricks to process the raw practitioner json from S3 bucket to convert into delta lake table
Developed Airflow Scripts for scheduling & Orchestration
Used Terraform for deployment and Docker & Jenkins to create and run Unit Test & Functional test automation pipelines.
Python AWS Developer
Cigna Healthcare
11.2021 - 11.2022
Experienced in the design and development of data pipelines using PySpark Databricks to process healthcare practitioner, and organization data and load them to AWS S3
Experienced in working with AWS services such as S3, Step functions, Lambda, API Gateway, Airflow jobs and logs
Construct ETL framework for optimization in Cloud using Spark Context, Spark-SQL, Data Frame, RDD’s
Created Lambdas to trigger and process the data once we have source files available in the S3 and setup SQS for queuing the data and notify using SNS
Understanding of real-time data integration using Kafka & Spark streaming
Experienced in writing unit tests using the PyTest framework
Developed CICD pipelines using GitHub, Jenkins and Terraform
Monitor the status of the non-prod CICD pipelines in AWS and fix the failure
Used Apache Airflow for orchestration and monitoring of jobs.
Python Developer
Capgemini Technology Services India Limited, BNP PARIBAS
06.2018 - 05.2019
Worked in Banking IT services, especially in Tax project called STARR (System of Tax Reclaim and Relief) built with Spring Boot, Django and Angular JS
Involved in the design and development of business requirements specified by each tax authority present in different countries
Calculation and settlement of tax reclaim for dividend after processing the beneficiary and stock details, generating tax forms for 100+ countries
Developed views and templates with Python Django framework and created a user-friendly website feature to calculate and report refund tax amounts
Developed frontend and backend for refund systems using Python on Django
Outputting the parsed data as JSON and loaded in MongoDB
Worked with JSON based REST Web services
Used data types like Dictionary, Tuples, and inheritance features for making algorithms of networks for Tax amount calculation in STARR system
Wrote and executed various MYSQL database queries from python using Python-MySQL connector and MySQL dB package
Managed large datasets using Panda data frames and MySQL
Used GIT for version control and to track changes in the code
Use RabbitMQ as messaging broker to execute asynchronous tasks
Experience with ETL development, data modeling, testing, and documentation.
Programmer Analyst
Cognizant Technology Solutions
01.2015 - 06.2018
United parcel services (ups), Involved in all phases of the Software development life cycle (SDLC) using Agile Scrum and Waterfall Methodology
Designed and developed a Smart compare tool using Python Django and added all the modules as a web service
This tool compares large sets of data from 2 different environments and performs various processes like Finding file format, Batch comparison controls, on-Demand comparison controls and Generating Comparison reports
This application was developed to minimize the 90% of manual effort in the Parallel testing phase
Designed and developed enterprise API’s and was involved in testing API’s using SoapUI and Postman
Optimized data storage and search queries for healthcare and non-healthcare applications using PL/SQL and improved the fastness of applications
Worked on Data formats like EDI, SAP IDOC’s, XML, CSV, Flat file structures and utilized MySQL, Oracle databases, stored procedures and created reports to examine the inbound and outbound data
Performed Data translation and Data Mapping on large sets of supply chain data using middleware integration tools and provided data-driven solutions
Worked on Web services using standard Web protocols such as XML, SOAP, and HTTP
Developed Python web services for processing JSON and interfacing with the Data layer
Involved in automated deployment of the application using GIT and migrated from SVN to GitHub to merge and control the version efficiently
Extensively worked with QA team in coordinating the testing process and debugging the defects
Performed unit and integration tests for order management apps reducing the defects and bugs by 40%
Consolidated and refactored the order management web app that significantly improved code maintainability and increases performance by 50%.