Skip to content
View shikhararyan's full-sized avatar

Block or report shikhararyan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shikhararyan/README.md

👋 Hi, I'm Shikhar Aryan

Profile Views LinkedIn Medium HuggingFace


🚀 About Me

AI/ML Engineer building production-grade systems that solve real-world problems

I specialize in taking complex technical challenges and turning them into deployed solutions that people actually use. From HIPAA-compliant healthcare AI to legal document automation, I've built systems across domains that require both technical depth and practical execution.

📍 Location: Ghaziabad, India
💼 Current: AI Engineer @ LetsAI Solutions
✍️ Writer: 12,000+ views on Medium covering production AI systems


💡 What I Actually Build

🏥 Healthcare AI with Regulatory Compliance

Built HIPAA-compliant medical AI system using Graph RAG architecture with automated PII anonymization. Integrated BioBERT for medical entity recognition and intelligent patient-doctor matching based on symptom classification.

Tech: Graph RAG, BioBERT, NER, HIPAA compliance, Multi-document processing

⚖️ Legal Document Automation at Scale

Engineered system generating 150+ page immigration petitions in 45 minutes (vs. 50+ hours manually). Fine-tuned Llama 3.1 8B on 6,500+ curated examples using QLoRA, achieving 91% accuracy on gap detection vs. expert attorneys.

Tech: LangGraph, QLoRA fine-tuning, Gemini 2.5 (data curation), ChromaDB, Long-context generation

🎓 Multimodal Educational AI (10K+ Users)

First production implementation of Azure SQL vector search in India. Built multimodal RAG system integrating images and videos into AI tutoring sessions, achieving 99.7% uptime for 10,000+ students.

Tech: Azure SQL Vector (pioneer), Multimodal RAG, Smart media indexing, Production deployment


🛠️ Technical Expertise

expertise = {
    "advanced_architectures": ["Graph RAG", "Multimodal RAG", "Long-context generation (150+ pages)"],
    "fine_tuning": ["QLoRA", "Dataset curation (6,500+ examples)", "Model evaluation"],
    "production_systems": ["99.7% uptime", "10K+ users", "HIPAA compliance"],
    "specialized_domains": ["Healthcare (BioBERT)", "Legal AI", "Education"],
    "multi_agent": ["LangGraph", "CrewAI", "Complex orchestration"],
    "vector_dbs": ["Azure SQL Vector", "ChromaDB", "Pinecone", "MongoDB Atlas"],
    "llms": ["GPT-4o/o1", "Claude", "Llama 3.1", "Gemini 2.5"]
}

Core Stack

Python PyTorch LangChain Docker

Cloud & Databases

Azure AWS ChromaDB


📝 Recent Technical Writing

I write about production AI systems, not just tutorials. 10,000+ readers on Medium


🎯 Featured Projects

Fine-tuned model on 6,500+ denied petitions achieving 91% accuracy in identifying gaps that lead to RFEs. Automated data curation pipeline using Gemini 2.5.

Impact: 98% time reduction in petition review (50 hrs → 45 min)

Graph RAG system with automated PII anonymization for healthcare deployment. Uses BioBERT for medical entity recognition.

Impact: HIPAA-certified deployment for hospital chain

Azure SQL vector search implementation integrating images and videos into tutoring sessions. First production deployment in India.

Impact: 10,000+ students, 99.7% uptime

Multi-class sentiment classifier achieving 91% accuracy. Deployed on HuggingFace Hub.


📊 GitHub Activity

GitHub Stats

Top Languages


🎓 Background

B.Tech in Information Technology | JSS Academy of Technical Education (2019-2023)

Career Trajectory:

  • 2025-Present: AI Engineer @ LetsAI Solutions - Legal document automation, fine-tuning
  • 2024-2025: AI/ML Engineer @ Techoon Solutions - Healthcare AI, educational platforms
  • 2023: Research Analyst @ Quaintel Research - Market intelligence, data pipelines

💬 What I'm Known For

Building complete systems, not just prototypes
Production deployments with real users (10K+)
Advanced RAG architectures (Graph, Multimodal, Long-context)
Fine-tuning expertise with custom dataset curation
Working in regulated domains (HIPAA, legal)
Technical writing that explains complex systems clearly


🤝 Let's Connect

I'm always interested in discussing:

  • Production AI systems and architecture decisions
  • Fine-tuning strategies and dataset curation
  • RAG optimization (especially Graph RAG and multimodal)
  • Deploying AI in regulated domains (healthcare, legal)
  • MLOps and maintaining 99.7% uptime

LinkedIn Medium Email HuggingFace


⭐ If you find my work valuable, star the repositories you find interesting!


Building AI systems that work in production, not just in demos.

Pinned Loading

  1. Recommendation-system-Collaboratve-Filtering- Recommendation-system-Collaboratve-Filtering- Public

    Collaborative Filtering based Recommendation System: This repository contains the implementation of a collaborative filtering-based recommendation system for book recommendations. It utilizes user-…

    Jupyter Notebook

  2. Text-Classification-Transformer-Model- Text-Classification-Transformer-Model- Public

    This sentiment analysis model utilizes a Transformer architecture to classify text sentiment into positive, negative, or neutral categories with high accuracy. It preprocesses text data, trains the…

    Jupyter Notebook 2

  3. ChatBot-GPT-3.5-Turbo-Based- ChatBot-GPT-3.5-Turbo-Based- Public

    Personal Chatbot Powered by GPT-3.5" is a Flask web application that utilizes OpenAI's powerful GPT-3.5 model to create a conversational chatbot experience. Users can interact with the chatbot in n…

    HTML 1

  4. Biometric-based-Voting-machine- Biometric-based-Voting-machine- Public

    This project is a Voting Machine application built with Python and MySQL. It verifies voters' Aadhar card numbers, checks their eligibility to vote, and integrates with a finger recognition module …

    C++ 1

  5. Text-Generation-and-summarization-using-Pipelines- Text-Generation-and-summarization-using-Pipelines- Public

    This project demonstrates sentiment analysis and text generation tasks using Hugging Face's NLP framework. It showcases the preprocessing of text data, sentiment analysis with pipelines, and text g…

    Jupyter Notebook 1 1

  6. WEB_Scrapping-basic-technique WEB_Scrapping-basic-technique Public

    This assignment involves web scraping data from a Singapore government website using Python libraries like BeautifulSoup and requests. The scraped data includes names, citizenships, and dates of bi…

    Jupyter Notebook 1