Skip to content
View haswanth13901's full-sized avatar

Block or report haswanth13901

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
haswanth13901/README.md

Hi πŸ‘‹, I'm Haswanth Chowdary Tummala

AI Engineer Β· GenAI & RAG Systems Β· Agentic Workflows Β· Scalable APIs & MLOps


πŸ’« About Me

πŸ”­ Currently: AI Engineer at Veritex Bank (Dallas, TX) β€” building production agentic AI platforms with LangChain, LangGraph, and Semantic Kernel, RAG/GraphRAG pipelines, LLM inference endpoints via AWS Bedrock & SageMaker, and FastAPI microservices powering high-throughput financial AI workflows.

🀝 Looking to collaborate on: Real-world AI systems with meaningful production constraints β€” agentic workflows, RAG/GraphRAG architectures, LLM fine-tuning (LoRA/PEFT), and MLOps pipelines. If it ships to prod and handles failure gracefully, I'm interested.

🌱 Currently deepening: Advanced multi-agent orchestration with LangGraph, GraphRAG with Neo4j, and cross-cloud model hosting strategies across AWS Bedrock, Azure AI Services, and GCP.

πŸ’¬ Ask me about:

  • Agentic AI platforms: multi-agent workflows, MCP servers, tool schema design, LLM gateways
  • RAG & GraphRAG pipelines: FAISS, ChromaDB, Pinecone, Weaviate, Neo4j knowledge graphs
  • LLM deployment & cost optimization: AWS Bedrock/SageMaker, LoRA/PEFT fine-tuning, Redis prompt caching
  • FastAPI microservices: async patterns, JWT/RBAC, Celery, webhook integrations
  • Full-stack AI: React/Next.js dashboards wired to ML backends, PySpark batch inference

⚑ Fun fact: I've shipped AI systems in banking and healthcare β€” domains where model failures have real consequences. That keeps the bar high and the excuses low.


πŸ› οΈ Tech Stack

πŸ€– AI / GenAI

LangChain LangGraph LlamaIndex Semantic Kernel OpenAI HuggingFace PyTorch TensorFlow scikit-learn

πŸ—„οΈ Vector DBs & Knowledge Graphs

FAISS ChromaDB Pinecone Weaviate Neo4j

πŸ’» Languages

Python TypeScript JavaScript SQL Java

βš™οΈ Backend & APIs

FastAPI Django DjangoREST NodeJS Redis Celery JWT

🎨 Frontend

React Next JS Redux Angular TailwindCSS

☁️ Cloud & DevOps

AWS Azure Google Cloud Docker Kubernetes GitHub Actions

πŸ“Š Data & Observability

Apache Spark Apache Kafka Pandas Power Bi Postgres OpenTelemetry Datadog


πŸ† GitHub Trophies


πŸ“ Publication

Ramya, P., Tummala, H. C., et al. (2023). Number Plate Recognition Using Optical Character Recognition and Connected Component Analysis. Smart Technologies in Data Science and Communication, Lecture Notes in Networks and Systems, Vol. 558, pp. 29–40. Springer, Singapore. https://doi.org/10.1007/978-981-19-6880-8_3


✍️ Random Dev Quote


Popular repositories Loading

  1. haswanth13901 haswanth13901 Public

  2. SLM-RAG-Search-Assistant SLM-RAG-Search-Assistant Public

    CPU-only RAG search assistant β€” real-time web answers + document Q&A using a local LLM, no GPU required.

    Python

  3. Django_Doctor-Finder Django_Doctor-Finder Public

    Full-stack doctor search platform with Django REST API, FHIR-compatible practitioner data, JWT auth, PostgreSQL backend, and Next.js/React frontend.

    Python

  4. RAG-chatbot-LangChain-observability RAG-chatbot-LangChain-observability Public

    Production-grade RAG chatbot using LangChain, Groq LLaMA, Gemini embeddings, and ChromaDB β€” with Phase 2 observability pipeline (latency tracking, token usage, CI/CD)

    Python