Available for opportunities

Lakshay
Hasija

Data Analyst → Data Scientist

Building production-grade data pipelines, LLM-powered tools, and ML systems that turn raw data into decisions. 2.5+ years turning complexity into clarity.

2.5+
Years Experience
3
Production Projects
90%
Automation Coverage
40%
Reporting Time Saved
01

About Me

I'm a Data Analyst at Midtown Software with 2.5+ years of experience transforming messy data into decisions that matter. My work spans SQL pipelines, Python automation, Power BI dashboards, and increasingly — machine learning.

What drives me is the gap between data and action. I build systems that close that gap automatically — whether that's an NLP pipeline that classifies 10,000 notices or a real-time chatbot that surfaces database insights on demand.

I'm actively growing toward Data Science — building projects that involve LLMs, ML deployment, statistical experimentation, and drift monitoring. Not just notebooks — production systems.

B.Voc in Software Development from Ramanujan College, Delhi University (8.5 CGPA, 2023).

Automation First
Reduced manual reporting by 40% through Python + Power BI automation frameworks at Midtown Software.
🤖
LLM + Data
Built a production SQL Agent powered by Gemini + LangChain — plain English to SQL to business insight.
📊
Stats-Driven Decisions
Built an A/B testing framework with Frequentist, Bayesian, and Power Analysis — not just dashboards.
🔁
Production ML
End-to-end ML pipeline with FastAPI deployment, MLflow tracking, and real-time drift detection.
02

Skills

Languages & Tools
Python SQL LangChain FastAPI Streamlit Git Azure Jenkins Selenium Pandas NumPy
Machine Learning
Scikit-learn Random Forest MLflow Drift Detection NLP Feature Engineering Model Deployment GenAI
Statistics & Analysis
A/B Testing Hypothesis Testing Bayesian Analysis Power Analysis Confidence Intervals EDA ETL
Data Visualization
Power BI Plotly Matplotlib Seaborn Tableau
Databases
MySQL MS SQL Server SQLite Cosmos DB SQLAlchemy
03

Projects

🤖
LLM-Powered SQL Agent
Natural language to SQL pipeline using Google Gemini + LangChain. Ask plain English questions — get SQL, results, and a business summary instantly. No SQL knowledge needed.
▸ Plain English → SQL → Business Insight
Python LangChain Gemini API SQLite Streamlit
🧪
A/B Testing Framework
Complete statistical experimentation framework built from scratch. Implements Frequentist Z-tests, Bayesian Beta-Binomial models, and Power Analysis with interactive visualizations.
▸ Frequentist + Bayesian + Power Analysis
Python SciPy Plotly Streamlit
🔁
ML Pipeline + Drift Monitoring
End-to-end churn prediction pipeline — model training, FastAPI deployment, MLflow experiment tracking, and KS-test drift detection with automated retraining alerts.
▸ ROC-AUC 0.785 | REST API | Drift Detection
Scikit-learn FastAPI MLflow Streamlit
⚙️
Azure CI/CD & RPA Pipeline
Python-based RPA bot with Jenkins CI/CD pipeline for automated data extraction. Fault-tolerant upload to Cosmos DB via API endpoints with intermediate storage to prevent re-scraping.
▸ Fault-tolerant | Zero data loss architecture
Python Azure Jenkins Cosmos DB RPA
04

Experience

Midtown Software
Data Analyst
Mohali, Punjab
Sep 2023 — Present
2.5+ years
  • Team Leadership: Manage and mentor a team of 4, guiding cross-functional collaboration to design data-driven decision frameworks and automate reporting workflows.
  • ML & NLP: Automated notice classification pipelines using Python and NLP models achieving 90% automation coverage on unstructured text data.
  • Business Intelligence: Engineered Power BI dashboards improving data visibility by 25% and reducing manual reporting efforts by 40%.
  • Database Optimization: Executed complex SQL-based ETL workflows and optimized queries to streamline data integration across multiple sources.

Let's Connect

Open to Data Analyst and Data Science roles. If you're building something interesting with data — let's talk.