Available for full-time roles

Madhavi
Akella

Data & AI Engineer  ·  Generative AI  ·  AWS AI/ML  ·  Cloud Data Platforms

5+ years of enterprise data engineering experience at Accenture, complemented by 4+ years of independent data engineering consulting and applied AI/ML development. Certified Databricks Generative AI Engineer. Deploying real RAG and LLM applications. Based in Cincinnati, Ohio — ready to contribute immediately.

Certifications
🏅 Databricks GenAI Engineer
🏅 Azure AZ-900
🏅 Snowflake
🏅 IBM Data Science
⏳ AWS AI Practitioner
5+
Years at Accenture
4+
Years independent consulting
65%
ETL performance improvement
What I Build With

Skills & Technologies

Generative AI & LLMs
RAG Pipelines · LangChain · Prompt Engineering · OpenAI · Amazon Bedrock · Hugging Face · FAISS · Pinecone · Chroma
AWS AI/ML Services
Amazon SageMaker · Bedrock · Rekognition · Comprehend · Lex · Glue · Lambda · S3 · Redshift
Data Engineering
ETL/ELT · Informatica PowerCenter · IICS · Star Schema · SCD Type II · Medallion Architecture · Lakehouse
Cloud Platforms
AWS · Microsoft Azure · Azure Databricks · ADF · ADLS Gen2 · Azure Event Hub
Big Data & Streaming
Databricks · Apache Spark · PySpark · Delta Lake · Structured Streaming · Snowflake
Programming & BI
Python · SQL · Shell Scripting · Power BI · Tableau · Alteryx · Streamlit
Live Deployments

AI Portfolio

🤖
Feb 2026
LLM Document Q&A — RAG Chatbot
Upload any PDF and ask questions in plain English. Retrieval-Augmented Generation pipeline powered by LangChain, OpenAI, and FAISS. Deployed on Streamlit Cloud.
RAG LangChain FAISS OpenAI Streamlit
🎯
Mar 2026
AI Resume Job Matcher
Semantic matching system that scores your resume against any job description using Sentence Transformers and LLM-powered skill gap analysis with actionable suggestions.
NLP Sentence Transformers GPT-4 Streamlit
Nov 2025 – Jan 2026
AI Coffee Demand Predictor
ML forecasting model integrating weather and event data. Improved accuracy from 60% to 90%, reducing waste 50% and stockouts 75%. Demonstrated $12K+ annual savings per store.
scikit-learn Python Streamlit UC Berkeley
🏗️
Aug 2024 – Jun 2025
Azure Retail Data Lakehouse
Full Medallion Architecture (Bronze/Silver/Gold) on Azure using ADLS Gen2, Azure Data Factory, and Databricks. PySpark ETL with partitioning and caching optimizations.
Azure Databricks PySpark Delta Lake ADF
🔍
Apr 2026
AWS-Style Sentiment & NLP Analyzer
Replicates Amazon Comprehend — sentiment analysis, key phrase extraction, and entity detection with AWS-style JSON output. Single and batch analysis modes.
NLP AWS Comprehend Sentiment Analysis Streamlit
👥
Apr 2026
Employee Attrition Risk Predictor
ML classifier predicting employee attrition risk with 85%+ accuracy. Models 8 risk factors, generates HR recommendations, and projects $900K+ annual savings for 500-person companies.
Gradient Boosting scikit-learn HR Analytics Streamlit
Feb – Sep 2023
Real-Time Streaming Pipeline
Built real-time data ingestion pipelines using Azure Event Hub and Databricks Structured Streaming. Processed high-volume data with PySpark, stored in Delta Lake with checkpointing and fault-tolerant mechanisms for near real-time dashboarding.
Azure Event Hub Structured Streaming Delta Lake PySpark
🔄
May – Dec 2022
ETL Migration — Informatica to Databricks
Migrated legacy Informatica PowerCenter ETL workflows to Azure Databricks and ADF. Converted complex mappings to optimized PySpark transformations with full data validation, reconciliation, and ADF orchestration for scheduling and monitoring.
Informatica PowerCenter Azure Databricks ADF PySpark
Where I've Worked

Professional Experience

Jul 2014 – Apr 2019
Informatica Developer
Accenture Services Pvt. Ltd · Hyderabad, India
2019 – Present
Independent Projects & Continuous Upskilling
Self-directed · Cincinnati, Ohio
Let's Connect

Ready to Contribute

Fully available for Data & AI Engineer roles. Based in Cincinnati, OH — open to remote and hybrid positions.