Data Engineer I
July 2025 — CurrentAt Albertsons Companies, Inc
Hi, I’m Prajwal — born and raised in Bangalore and now building my career in the world of data and AI. I have recently earned my Master’s in Business Analytics & Artificial Intelligence at UT Dallas.
1️⃣ Right now, I work at Albertsons as a Data Engineer, where I build data pipelines and tools that make financial processes like Profit & Loss allocation more efficient and reliable. Before this, I worked at GenerativeProduct.io, where I explored the practical side of Generative AI, from RAG systems to agentic AI. I enjoy working at the intersection of data engineering and applied AI — solving problems end to end, from moving raw data to delivering something that creates impact.
2️⃣
2 – Outside of work (hobbies & interests):
Outside of work, I’ve recently gotten back into cycling, aiming for 8–10 miles per ride as I rebuild consistency. I’m also planning to try candle-making soon as a creative side hobby. I love playing FIFA, watching Manchester United, and mixing football stats with my love for the game. When I’m not on the bike or gaming, I’m usually catching up on tech trends, news, and current affairs, or just exploring new hobbies that let me recharge and stay curious.
Contact me at prajwalkp.work@gmail.com
I'm currently diving deeper into generative AI and large language models, exploring how they can transform business intelligence and analytics.
Exploring Microsoft Synapse Analytics and Azure services to build scalable data solutions, optimize analytics workflows, and enhance cloud-based data processing.
Languages
Python, R SQL and VectorDB - FAISS, Pinecone SAS Bash/Shell ScriptingTools
Data Viz - Tableau, Looker and PowerBI Databricks Snowflake Docker & Kubernetes Git, GitLab and Azure Repos Jira & Confluence Apache Kafka, Airflow, Spark, dbt HadoopCloud
Azure: Synapse Analytics, DevOps AWS: EC2, S3, QuickSightMachine Learning and AI
Scikit-Learn Tensorflow PyTorch LangChain & LLamaIndexLibraries
Numpy, Pandas, Matplotlib Plotly & Seaborn Flask & StreamlitBuilt an automated ETL pipeline using Apache Airflow to extract book data directly from the Amazon Bookstore, transform it for consistency, and load it into a PostgreSQL database. This enabled structured analysis of titles, pricing, ratings, and availability for downstream insights.
CometVerse is a voice-enabled AI chatbot built for UT Dallas, designed to handle student queries using LLMs and NLP. It supports conversation history, voice input, and PDF export, offering a smarter, more interactive alternative to traditional university chatbots.
An AI-powered tool designed to help candidates prepare for Product Management interviews by providing automated real-time feedback. It leverages Speech-to-Text, Retrieval-Augmented Generation (RAG), and LLMs to enhance interview readiness and improve responses.
Developed a predictive model using ML techniques (Linear/Polynomial Regression, SVR, Decision Tree, Random Forest, etc.) to forecast ride demand. The project supports smarter driver allocation, pricing strategies, and overall efficiency in ride-hailing services.
To build this, we will make usage of a popular distributed streaming platform Apache Kafka, to produce, consume and stream our data into BigQuery (data warehouse) so we can do some analysis using SQL and connect it to PowerBI where we are going to visualize on a dashboard.
The goal was to understand player injuries based on specific variables related to plays & game conditions. Apart from basic ML models, traditional survival analysis models were also explored - Cox Proportional Hazard model. Feature Importance using Mean Decrease in Impurity (MDI) was obtained.
Presented important insights and built a model for the prediction of real estate prices in the city of Bengaluru. Got an accuracy score of 0.68 for the model used. Comparison of other models - Lasso and Decision Trees was also explored.