Sachin is an accomplished Senior Data Engineer and certified Project Management Professional (PMP) with extensive expertise in data engineering, cloud platforms, ETL tools, and advanced analytics. With a strong educational background, including a Doctorate in Healthcare Administration (DHA) and multiple Master's degrees, Sachin has consistently excelled in designing and managing complex data systems across diverse industries such as healthcare, finance, retail, aerospace, and education.
Sachin's technical expertise encompasses advanced tools and platforms, including Google Cloud Platform (GCP), Apache Airflow, IBM DataStage, Apache Spark, and a variety of programming languages like Python, Scala, and SQL. He has a proven track record in building scalable data pipelines, optimizing ETL processes, and delivering impactful data solutions in Agile environments.
With over two decades of experience, Sachin has contributed to high-profile organizations, including Surescripts, Deloitte, and Bank of America, driving data integration, project management, and analytics. His achievements include developing innovative ETL modules, improving performance efficiencies, and spearheading the transition to modern cloud-based solutions.
A detail-oriented professional, Sachin combines technical acumen with deep business knowledge in domains such as healthcare, energy, and customer feedback management. As a U.S. citizen based in Minneapolis, MN, he exemplifies leadership, adaptability, and a commitment to continuous learning in the ever-evolving field of data engineering.
Over the course of his career, Sachin S. Rath has demonstrated exceptional expertise in data engineering, ETL development, and project management, making significant contributions to a variety of organizations across industries. Since June 2018, Sachin has served as a Senior Data Engineer at Surescripts in Minneapolis, MN, where he has played a pivotal role in transitioning legacy systems to modern cloud-based solutions. His work includes the conversion of Datastage jobs and PL/SQL routines into Google Cloud Platform (GCP) codebases, using tools such as Apache Airflow, Big Query, and Spark-Scala-Python. By developing and enhancing ETL systems, he has streamlined operations and supported cross-functional collaboration between onsite and offshore teams.
Prior to this role, from June 2017 to April 2018, Sachin worked with Deloitte for the Oregon Department of Justice in Salem, OR, where he designed efficient ETL pipelines to modernize legacy databases. His adherence to strict data lineage standards and collaboration with online application teams ensured quality and accuracy in data migration. Earlier, at Martin Marietta in Raleigh, NC, during April 2017 to May 2017, he was instrumental in converting Hadoop queries into DataStage jobs, enhancing the company’s data integration capabilities.
Sachin’s expertise extends to working with leading organizations like USAA in San Antonio, TX, where in March 2017, he documented and analyzed Datastage Parallel jobs for ServiceNow data integration. From October 2016 to February 2017, he contributed as a Datastage Specialist for Bank of America in Pennington, NJ, where he designed complex ETL jobs for the Risk Management Supervision (RMS) system, employing LINUX scripts for automation and reconciliation.
At TIAA in Iselin, NJ, Sachin worked as a Senior Datastage Architect between May 2016 and September 2016, where he managed insurance and retirement plan data using advanced Datastage processes and Java APIs. During his time at Accenture (October 2013 to April 2016), he delivered multiple high-impact projects for clients such as TJX and NiSource, leading the design of end-to-end ETL solutions and supporting platforms like PeopleSoft EPM.
Sachin also contributed to critical public-sector projects. At the U.S. Department of Education in Washington, D.C., during July 2013 to September 2013, he managed ETL systems for Federal Student Loan data. Similarly, at Boeing in Ridley Park, PA, from July 2009 to December 2012, he enhanced ETL processes for the company’s Supplier Data Warehouse, achieving significant performance improvements and integrating data from multiple locations.
Earlier roles include his work at Clarabridge Inc. in Reston, VA, from March 2009 to July 2009, where he developed Cognos reports for high-profile clients and optimized Clarabridge Mining Platform (CMP) processes. At NVR Inc., between July 2006 and November 2008, Sachin designed real-time ETL solutions and maintained data warehouses, significantly improving processing times.
Sachin’s career began at Mahindra-Satyam from April 1996 to July 2006, where he gained foundational expertise in data engineering and reporting. His contributions included working on projects for Caterpillar Inc. and Delta Airlines, specializing in Cognos reporting, PeopleSoft integration, and mainframe systems.
Throughout his journey, Sachin has consistently delivered results, whether optimizing data pipelines, transitioning legacy systems, or ensuring the integrity of critical data systems. His vast experience across domains such as healthcare, finance, aerospace, and retail showcases his adaptability and commitment to excellence. His career is a testament to his technical acumen, leadership capabilities, and dedication to innovation in data engineering.
Self-improvement
• Doctor of Healthcare Administration (DHA): 3.67 GPA, Virginia
University of Lynchburg (VA), Aug 2022.
• MBA: 4.0 GPA (awarded “Outstanding Graduate”), Clarion Univ (PA),
May 2013
• Project Management Professional (PMP, March 2002 – March 2027)
• M. S., Comp. Sc.: 4.0 GPA, Bradley University (IL), June 2006
• B. Engg., Comp. Tech., 3.85 GPA (Univ 1st rank), Nagpur Univ. (India),
May 1995
Skills
ETL Tools: IBM DataStage (v11.5/9.1 Parallel & v8.7 Server)
BI Tools: Cognos 10.2.2
Languages and
Scripts: Scala, Python, Oracle PL/SQL, Big Query Script, SQL Server T-SQL, DB2 Stored Proc, Cobol, SAS/SAS SCL, Java, JavaScript, Perl, C++
Databases: GCP Big Query, Oracle, SQL Server, DB2, Informix, Netezza, Teradata
Miscellaneous: IntelliJ, Apache-Spark, GCP Cloud Composer v2.4.2 – Apache Airflow v2.5.3, LINUX/AIX, ERwin 9.5, E-R Studio 6.5, Sybase Power Designer 15.3, Schedulers: AutoSys/Control-M/SQL Server Agent, PeopleSoft EPM 8.9/HCM 8.3
Training Hadoop, Tableau, Apache-Spark-Scala-Python, Google Cloud Platform - Big Query-Apache Airflow-Python, R, Informatica Axon v7.3, Informatica EDC v10.5
Professional Experience
1. Surescripts, Minneapolis, MN (06/2018) – Till date
Senior Data Engineer (Full-time employee)
• Coordinated with onsite and offshore (Onix-Datametica) workers to convert Datastage jobs and PL/SQLs to GCP codebase, DAGs using Apache Airflow/Composer-Python and Google Big Query
• Built new Airflow DAGs and Big Query routines and scripts to supplement the converted environment while also making enhancements to existing DAGs and BQ objects
• Built data pipelines from table to table and file to table in an Apache-Spark-Scala-Python environment using JSON scripts to generate Control-M jobs to execute spark jobs
• Worked in an Agile environment to build and maintain ETL to extract data from Oracle, Impala – Hadoop, and SalesForce sources to load into a data warehouse residing on Oracle 12c as well as on Hadoop
• Developed ETL with Datastage v11.7 Parallel and Oracle PL/SQL as the ETL engines on a Linux platform, with SQL Developer and DbVisualizer as database browsers, SourceTree as a version control tool, and IIS operations console to monitor activity
• Provided production support to existing ETL system
• Converted SQL Server Agent Jobs to Control-M schedules and developed new scheduler jobs
• Spearheaded the installation, testing, and deployment of CONTROL-M as the scheduler, converting SQL Server Scheduler processes to CONTROL-M and establishing standards for Hadoop File Watchers and FTP
• Developed pioneering ETL modules: Type1/2 loads, long-running process monitor, force-stop runaway process, DS job link row counts auditing, database error/reject link processing, reading Excel files directly
2. Client: Deloitte – Oregon Dept. Of Justice, Salem, OR (06/2017) –(04/2018)
ETL Developer (employer: Unica Group)
• Developed ETL using IBM Infosphere v11.5 Parallel in a Win 2012 environment to convert data from the DB2 Legacy Child Support database to the highly relational target DB2 database
• Used IBM Data Studio 4.1.2 for database querying and DB2 stored procs for validation
• Researched issues and corrected defects using strict data lineage analysis standards; worked with the Online App team to fix defects and online requirements
3. Martin Marietta, Raleigh, NC (04/2017) – (05/2017)
Senior ETL Developer (Full-time employee)
• Converted existing Hadoop environment queries to DataStage v11.3/.5 Parallel jobs and performed day-to-day maintenance of existing DataStage v11.3/.5 jobs and sequences
4. Client: USAA, San Antonio, TX (03/2017)
Consultant (employer: Apex Systems)
• Analyzed and documented existing Datastage v9.1 Parallel project jobs and sequences that extract ServiceNow data residing on the Cloud, extracted into SQL Server collection tables using Java connector stage, and loaded into Netezza dimensional DW; performed fit-gap analysis of ServiceNow data and created a reverse-engineered data model in Erwin 9.5 for ServiceNow reporting SQL server database
5. Client: Bank of America, Pennington, NJ (NYC area) (10/2016) – (02/2017)
Datastage Specialist (employer: ADP TotalSource)
• Designed and developed IBM Datastage v9.1 Parallel ETL jobs and sequences to generate alert records for Risk Management Supervision (RMS) system running on LINUX 2.6.32 as an operating platform, Autosys as a scheduler, processing data in various forms such as DB2 database, Web Services, flat files, complex flat files, and XML files.
• Wrote LINUX scripts for job execution, record count reconciliation, and file archival/handling
6. Client: TIAA, Iselin, NJ (NYC area) (05/2016) – (09/2016)
Sr. Datastage Architect (employer: Mitchell Martin Inc.)
• Designed and developed IBM Datastage v11.5 Parallel processes to maintain a custom-MDM environment comprising of insurance and retirement plan information running on LINUX 2.6.32 as the operating platform and Autosys as the scheduler interacting with files, database tables, XML files, and Java APIs, with an Oracle 11g target database
7. Accenture, Arlington, VA - D.C. area (10/2013) –(04/2016)
Digital Integration Consultant (Full-time employee)
• Worked at TJX (05/2015 – 04/2016) at Marlborough, MA as the Lead Datastage Developer to lead and deploy a project from end-to-end designing and developing IBM Datastage ETL processes executed by AIX scripts to convert and reconcile PO data between multiple systems using Datastage v8.7 Parallel running on an AIX platform and connected to DB2, Oracle, SQL Server, and Netezza databases
• Worked at NiSource (06/2014 – 04/2015) Charleston, WV/ Columbus, OH/ Houston, TX to:
o Design and develop IBM Datastage Parallel v8.7 ETL on an AIX platform, the databases SQL Server 2008 R2 and Oracle 10g
o Implement high-performing, flexible, and reusable Datastage ETL processes utilizing Runtime Column Propagation (RCP), Change Capture (CC) and Checksum features using PL/SQL scripts to perform data archival and shell scripts to execute jobs
• Maintain Cognos 10.2.1 Financials DW reports on the PeopleSoft EPM 9.1 platform
• Worked at Health Alliance Plan (10/2013 – 05/2014), Southfield, MI (Detroit) to develop and maintain highly complex ETL processes using DataStage Parallel v8.5 running on a Windows platform and PL/SQL scripts
8. Client: U.S. Dept of Education (ED), Washington, DC (07/2013) – (09/2013)
Datastage Developer (employer: ANR Consulting)
• Maintained highly complex ETL processes using DataStage Parallel v8.7 running on a Linux platform to extract Federal Student Loan data to populate a dimensionally modeled Oracle 11g data warehouse; wrote shell scripts to process, archive, and audit files
9. Client: Vanguard, King of Prussia, PA (06/2013) – (07/2013)
Datastage Developer (employer: Judge Group)
• Designed and built highly complex ETL processes using DataStage Parallel v8.7 running on a UNIX platform using Oracle databases and files
10. Client: Univ. Of Penn Medicine, Philadelphia, PA (02/2013) – (06/2013)
Datastage Analyst (employer: CEI)
• Developed ETL using DataStage Parallel v8.7 on an Oracle and Linux platform with sources in various RDBMSs such as SQL Server, Oracle, DB2, and Informix to populate star schemas for clinical data marts designed using Sybase PowerDesigner 15.3
11. Client: Boeing, Ridley Park, PA (07/2009) – (12/2012)
Datastage Consultant (employer: Comforce)
• Developed ETL for Boeing’s Supplier DW on DataStage Server v8.7 following Kimball Methodology in an Oracle Change Data Capture(CDC) staging environment integrating data from a dozen Boeing locations across the country spread on different DBMS such as Oracle, SQL Server, and Teradata.
• Improved performance of ETL processes by 25 – 35%
• Built and maintained tables, views, materialized views, PL/SQL procedures, and monitored/tuned database performance on Oracle 11g /10g; Developed shell scripts for Autosys scheduler jobs on AIX 5.3
12. Clarabridge Inc., Reston, VA (03/2009) – (07/2009)
Consultant (Full-time employee)
• Developed Cognos v8.4 reports for top clients such as BestBuy, Sage Software, AOL, McDonald's, GAP, and United Airlines
• Developed Clarabridge Mining Platform (CMP) processes; online web surveys on Vovici EFM framework using complex Javascripts
• Designed and built PL/SQL scripts/anonymous blocks, Perl scripts, tables, views, materialized views, and stored procedures on Oracle 10g platform
13. NVR Inc., Reston, VA (07/2006) – (11/2008)
Data Warehouse Administrator (Full-time employee)
• Designed and built data warehouses on a variety of Financial, Mortgage, and Manufacturing data sources using DataStage Server v7.5.2 and T-SQL on dimensional and relational data models built with Erwin 7.1
• Pioneered real-time ETL, CRC, environment variables, and FTP improving the ETL processing time by 30 – 50%
• Provided Production Support on a 24/7 basis as the primary on-call support person and assisted in upgrades such as DataStage v7.5.2, SQL Server 2005, and Hyperion 8.3
14. Mahindra-Satyam (04/1996) – (07/2006)
Consultant (Full-time employee)
• Caterpillar Inc. in Peoria, IL (04/1999 – 07/2006) on various projects involving Cognos ReportNet platform, DataStage Server v7.5, Oracle 10g/9i, DB2, PeopleSoft HCM, PeopleSoft SQR reports
• Delta Airlines in Eagan, MN, and India (04/1996 – 04/1999) on various mainframe projects using COBOL DB2, IMS DB/DC, CICS, SAS
Sign up to view 0 direct reports
Get started
This person is not in any teams