Portfolio Website

Sameer Enjapuri

I'm

I am a detail-oriented and results-driven professional with 5 years of expertise in Data Engineering and Analytics. I am proficient in designing, building, and optimizing data pipelines for efficient extraction, transformation, and loading (ETL) processes. I excel in interpersonal skills and am comfortable working with stakeholders across disciplines, propelled by an unwavering dedication to seamless data integration.

Sameer Enjapuri
Python SQL ETL Kafka Tableau PowerBI Informatica AWS HTML Data Warehouse CSS VS Code Python SQL ETL Kafka Tableau PowerBI Informatica AWS HTML Data Warehouse CSS VS Code

About Me Skills

Languages

  • Python (Pandas, NumPy, Scikit Learn)
  • R
  • C
  • SQL
  • HTML
  • CSS

Databases

  • Oracle
  • MySQL
  • PostgreSQL
  • MongoDB
  • MS SQL

Data Warehousing

  • ETL (Extract, Transform, Load)
  • MS SQL Server
  • Apache Kafka
  • Confluent Kafka
  • Informatica PowerCenter

Data Visualization

  • Tableau
  • MS Power BI
  • MS Excel
  • Qlik Sense
  • Looker Studio
  • Amazon QuickSight

Data Analytics

  • Exploratory Data Analysis
  • Data Cleaning
  • Data Visualization
  • Data Modeling
  • Statistical Analysis

Cloud Technologies

  • AWS (Amazon S3, AWS Glue, AWS Lambda, AWS Athena)

A bit about me

Hi, I'm Sameer Enjapuri, a detail-oriented and results-driven data engineer and analyst with over 5 years of experience in designing, building, and optimizing data pipelines. My expertise lies in ETL processes, data integration, and visualization, which I've honed through various roles and projects.

I'm currently pursuing a Master of Science in Business Analytics and Information Systems at the University of South Florida, where I'm expected to graduate in May 2025 with a CGPA of 3.87/4. I also hold a Bachelor of Engineering in Electronics Engineering from Shah & Anchor Kutchhi Engineering College.

My technical skill set includes Python, R, C, SQL, HTML, CSS, and various databases like Oracle, MySQL, PostgreSQL, and MongoDB. I am proficient with tools such as GitHub, Jenkins, Docker, and AWS, and I excel in data visualization using Tableau, MS Power BI, and Looker Studio. Additionally, I have extensive experience with ETL pipelines and Kafka.

I'm currently working on a project at the University of South Florida where I lead the development of comprehensive reports using Tableau to analyze economic performance in the Tampa Bay area. In my previous role at Capgemini, I developed real-time data integration solutions using Confluent Kafka and Python, significantly enhancing data streaming efficiency and decision-making processes for a banking client.

I'm passionate about leveraging data to drive strategic decisions and improve operational efficiency. Whether it's through building advanced ETL pipelines, optimizing database performance, or creating insightful visualizations, I strive to make data accessible and actionable for stakeholders across disciplines.

Experienced in collaborating with cross-functional teams, including data scientists, business analysts, and software engineers, to achieve organizational goals. Effectively communicates technical concepts to non-technical stakeholders, ensuring alignment and understanding. Eager to contribute to projects that utilize data to improve people's lives and make a positive impact on society.

Available for internship and full-time roles, including remote positions.

Recent Work Experience

Workplace 1 - University of South Florida
Workplace - University of South Florida

Graduate Research Assistant

Jan 2024 - Present

Contributed to the Tampa Bay E-Insights report, which examines the economic performance of the Tampa Bay region relative to 19 comparable Metropolitan Statistical Areas (MSAs). Developed comprehensive Tableau reports that analyze key areas such as economic outcomes, affordability, and the talent pipeline. These efforts provided critical insights for policy recommendations.

Workplace 2 - Capgemini
Workplace - Capgemini

Data Engineer/Analyst


Feb 2022 - Jul 2023

Analyzed data streams, reducing processing time by 40%. Engineered real-time data integration solutions with Confluent Kafka, improving streaming efficiency by 25%. Designed interactive Tableau dashboards for predictive analytics, enhancing decision-making by 30%.

Workplace 3 - Tata Consultancy Services
Workplace - Tata Consultancy Services

Data Engineer/Analyst


Sep 2018 - Jan 2022

Managed ETL pipelines in Informatica PowerCenter, achieving a 95% success rate in transitioning workflows to Kafka. Optimized database performance by 15% using advanced data modeling techniques. Applied business intelligence principles for accurate reporting and insights.

Academic Projects

Data Engineering YouTube Analysis

AWS, ETL, S3, Glue, Lambda, Athena, QuickSight

Developed a scalable data pipeline on AWS integrating services like S3, Glue, Lambda, and Athena for ingestion, ETL, and querying. Established a centralized data lake for storage and analysis of YouTube video data. Created a Dashboard using Amazon QuickSight for reporting and extracting insights from video metrics.

Analyzing Spotify Songs Dataset

Databricks, PySpark, ML, K-Means, Decision Tree Classifier

Performed analysis, visualization, and implemented ML models on a Spotify dataset stored in a distributed file system. Employed PySpark for data processing, querying, and building ML pipeline on the Databricks platform. Used K-Means clustering to cluster songs into similar playlists and a Decision Tree Classifier to classify songs into genres.

Hospital Management Advanced Database Management Project

SQL, SSMS, Azure

Designed ERD, EERD, RS and created a hospital management database with 14 tables in SQL SSMS. Created Stored procedures, Views, Triggers, UDFs to implement business logic and deployed the database to an Azure remote server.

Contact

My Address

14308 Wedgewood Ct

Tampa, FL 33613

Email

enjapurisameer@gmail.com

senjapuri@usf.edu

Contact

+1 813-389-8792