Available for AI engineering rolesCincinnati, Ohio

Akash Nallagonda.
AI Engineer.

I design and ship AI systems that connect language models to real data, real workflows, and real users, across Azure OpenAI, Azure AI Search, Model Context Protocol, FastAPI, Next.js, and cloud infrastructure.

View work Download resume GitHub LinkedIn

architecture / production rag

grounded · cited · low latencyazure ai foundry

46,000+

students and faculty served

Campus AI assistant Akash builds on at University of Cincinnati.

~$80,000

projected annual cost avoidance

TAMALE extraction gateway, running on a ~$13/month Azure footprint.

30%

lower response latency

Through retrieval and prompt router tuning on BearcatGPT.

20+

concurrent AI initiatives tracked

By leadership through the meeting intelligence platform.

200+

faculty and staff trained

On GenAI and Microsoft Copilot.

150+

students in responsible AI workshops

High school cohort at the RAISE AI Summit.

Best Security Hack

MakeUC 2024

Secure Sight, real time weapon detection platform.

IEEE ICCCI 2024

Published author

Real time student tracking and responsive design papers.

Featured AI systems

Production work, not demos

Four shipped systems chosen to show range: enterprise RAG, document AI middleware, an automation pipeline that replaces a vector database, and a full stack AI product.

View all projects

RAG and agentsProduction2025

BearcatGPT

Campus scale AI assistant and agent platform

Production generative AI assistant serving 46,000 plus students and faculty at University of Cincinnati. RAG pipelines on Azure AI Foundry with hybrid retrieval, Socratic tutoring agents, agentic routing, and MCP integrations to external enterprise systems.

Azure AI FoundryAzure OpenAIAzure AI SearchRAGMCPMulti agent

46,000+

Users

30%

Latency reduction

Read case study

live

RAG and agentsProduction2025

TAMALE AI Extraction Gateway

Enterprise RMS integration on a $13 per month footprint

Stateless FastAPI middleware that bridges Advent TAMALE and BearcatGPT. In memory bitstream extraction turns binary research attachments (PDF, DOCX, XLSX, PPTX) into structured JSON envelopes for LLM tool calls, hardened with API key auth, Key Vault secrets, SHA256 hashing, and standards compliant error codes.

FastAPIAzure App ServiceAzure Key VaultPyMuPDFpython docxopenpyxl

~$80,000

Projected annual cost avoidance

~$13/mo

Azure footprint

Read case study

AutomationProduction2025

AI Meeting Intelligence Platform

Executive use case tracker over Microsoft Teams transcripts

Five layer enterprise workflow that ingests Teams .vtt transcripts, generates strictly grounded JSON via Azure AI Builder, persists to a flat SharePoint list, and exposes the corpus to BearcatGPT through Microsoft Graph. Deliberately skipped a vector database in favor of a per person and date range access pattern.

Power AutomateAzure AI BuilderMicrosoft GraphSharePoint OnlineBearcatGPT

20+

AI initiatives tracked

Vector DB cost

Read case study

Full stack AIShipped2025

LectureMind

Full stack generative AI study platform

Generative AI learning platform that turns long form lectures into grounded outlines, summaries, flashcards, quizzes, mind maps, and faculty accessibility reports. Multi model orchestration with fast and reasoning paths, separate student and faculty workspaces, and a transient faculty session mode with cleanup on signout.

Next.jsTypeScriptPrismaPostgreSQL (Neon)Azure OpenAIAzure AI Search

Cloudforce No Resume Required

Hackathon

Fast + reasoning

Models routed

Read case study

View all projects

What I build

Four shapes of AI work I ship

Different stacks, same engineering bar. Each of these maps to a project on this site.

RAG and retrieval systems

Hybrid search over enterprise corpora with chunking, embeddings, BM25, and re ranking. Grounded answers with citations and refusal guardrails.

Azure AI SearchAzure OpenAItext embedding 3 large

LLM agents and tool use

Multi agent routing, controlled handoffs, tool calling with structured outputs, and Model Context Protocol servers connecting agents to real systems.

Azure AI FoundryMCPTool calling

Automation and workflow pipelines

Production automations that move work between SaaS, Microsoft 365, and LLMs on a schedule or trigger, with logging and human readable outputs.

Power AutomateMake.comMicrosoft Graph

Full stack AI products and applied ML

Next.js front ends, FastAPI workers, Postgres, and Azure infrastructure delivered end to end, with classical ML and computer vision where they fit.

Next.jsFastAPIPrismaAzure

Proof of work

Published, presented, recognized

Conference papers, leadership talks, hackathon wins, and education work that backs up the engineering.

Full proof page

Publication2024IEEE ICCCI 2024
An AI Based Student Tracking System to Analyze Student Behavior
Real time classroom behavior analysis and automated attendance using YOLOv8 and face recognition.
Publication2024IEEE ICCCI 2024
Responsive Design using HTML and Tailwind CSS
Adaptive layouts, accessibility, and cross device user experience.
Talk2026DTS AI Symposium
Transforming Learning and Teaching Assistance with BearcatGPT AI Agents
Presented multi agent tutoring workflows and retrieval design to university leadership and faculty.
Recognition2024MakeUC 2024
Best Security Hack
Awarded for Secure Sight, the real time weapon detection platform.
Source
Workshop2025RAISE AI Summit
Responsible AI workshops
Guided 150 plus high school students through responsible AI workshops.
Workshop2025University of Cincinnati
GenAI and Copilot enablement
Trained 200 plus faculty and staff on generative AI and Microsoft Copilot.

Technical stack

The stack I actually ship with

Grouped by what they do, not by alphabetical order. Production tools first. Things I am still exploring are clearly labeled.

AI and LLM systems

Production retrieval, agents, grounding, and prompt design across Azure AI and frontier models.

Azure OpenAIAzure AI FoundryAzure AI Search (Cognitive Search)RAG pipelinesVector embeddings (text embedding 3 large)BM25 hybrid retrievalChunking and relevance re rankingMulti agent systems and agentic routingPrompt engineeringTool calling and structured outputsModel Context Protocol (MCP) serversGrounding and refusal guardrailsAnthropic Claude APIMistral OCRAzure Speech (ASR)

Orchestration and automation

No code and low code pipelines that move real work between systems on a schedule or trigger.

Power AutomateAzure AI BuilderMicrosoft Copilot StudioMake.comMicrosoft Graph APITwilio API

Backend engineering

Python and Node services that put guardrails, auth, and clean contracts in front of models and tools.

FastAPIPydanticREST APIsAsync I/ONode.jsPrismaMicroservices

Frontend engineering

Type safe, accessible, recruiter ready interfaces for AI products and dashboards.

Next.js (App Router)ReactTypeScriptTailwind CSSshadcn/uiResponsive UI

Data and ML

Classical ML, computer vision, and document extraction work that supports the AI stack.

PandasNumPyscikit learnXGBoostTensorFlowYOLO (v8, v11)OpenCVPyMuPDF / pypdfpython docxopenpyxlpython pptxPapaParseFeature engineering and model evaluation

Cloud and DevOps

Azure first, with AWS and Vercel for shipping, plus the boring parts that keep things alive in production.

Azure App ServiceAzure Container AppsAzure FunctionsAzure Blob StorageAzure Key VaultApplication Insights / Azure MonitorMicrosoft Graph APISharePoint OnlineAWS (EC2, S3, Lambda, SageMaker)VercelDockerKubernetesGitHub Actions and CI/CDTerraform