Work Experience

Walmart Inc.

Data Engineer III, Dallas, TX

May 2022 - Aug 2022

Worked on an initiative to generate real-time reports on Walmart+ customer data, focusing on store signups. Built near real-time data streaming pipelines using Apache Spark and Apache Kafka to stream petabytes of click data into Google Cloud Storage. Developed comprehensive reports and interactive dashboards using Looker, providing actionable insights, and enhancing decision-making processes for the marketing and operations teams. This project improved data accessibility and reporting efficiency, allowing for timely analysis and response to customer behavior trends.


Network Science Institute - Lazer Lab

Data Scientist, Boston, MA

Nov 2021 - May 2023

Implemented and monitored scalable data pipelines to identify and characterize social behavior from social media data, primarily from Twitter. Developed and refined machine learning models to classify tweets as Misinformation or Disinformation. Created comprehensive Misinformation Dashboards for various events, providing real-time insights and visualizations. This work contributed to understanding the spread of false information on social media platforms. Additionally, I collaborated with cross-functional teams to ensure the robustness and accuracy of the pipelines and models, enhancing the overall efficacy of the project.


Fair Issac Corporation (FICO)

Software Development Intern, Bangalore, India

Jan 2021 - June 2021

Implemented and integrated a Regular Expression Validation feature for a Mortgage Decision Management Platform tool, enhancing data accuracy and validation efficiency. Developed and deployed multiple APIs to facilitate the conversion of data into various formats for optimal storage across different database systems. Successfully increased the project's code coverage by over 11% through comprehensive regression testing, improving the reliability and maintainability of the codebase. This work significantly contributed to the robustness and functionality of the platform.


Antriksh Labs Pvt. Ltd.

Research Intern, Mysore, India

Jan 2020 - Dec 2020

Initiated active methodological research practices to identify optimal visualization techniques for both structured and unstructured data. Conducted qualitative research and developed intelligent data pipelines for an AutoML SaaS product. Leveraged distributed systems such as Spark and Hadoop to accelerate data processing and statistical analysis, significantly enhancing the efficiency and effectiveness of data workflows. This work contributed to advancing the capabilities of the AutoML platform and improving its user experience.