About

7+ Papers Published
50+ Open-Source Projects
17+ Certifications
9.72 CGPA (B.Tech)

Independent AI/ML researcher and builder, specialising in building AI systems end-to-end: from implementing transformer architectures from scratch to deploying on-device deepfake detectors. Expertise spans LLM optimisation, computer vision, neural network compression, and synthetic data generation.

As a Research Intern at IIIT Hyderabad, I developed a Layout Preserving Document Translation system achieving 92% accuracy in layout detection with multi-language OCR support. I have published 7 research papers in international conferences and journals (IEEE, IJNRD, Zenodo) covering neural network compression, deepfake detection, and novel word representation systems.

Certified AWS Cloud Practitioner, AWS Solutions Architect, TensorFlow Developer, and WSO2 Micro Integrator Developer. Experienced across the full AI/ML stack — from building LLMs from scratch to deploying production systems on cloud infrastructure. I write an ML-from-scratch blog at blogs.gruheshkurra.com.

Education

B.Tech in Computer Science and Engineering
KL University, Hyderabad, India  ·  August 2021 – May 2025  ·  CGPA: 9.72 / 10

IELTS Academic  ·  Overall Band Score: 7.5  ·  C1 Advanced

Research Publications

Experience

Oct – Nov
2025
IA
AI Integration Engineer
InfoAjax Consulting Pvt Ltd  ·  Remote, Hyderabad
  • Led development of an AI-driven integration framework connecting enterprise applications with cybersecurity services via standardised APIs and intelligent orchestration.
  • Architected agent-based system using Azure OpenAI for automated schema mapping and adaptive connection strategies with performance learning from historical patterns.
  • Built full-stack application using React 18, Vite, TailwindCSS + FastAPI backend with WebSocket integration for real-time monitoring and agent coordination.
  • Integrated Azure Active Directory via Microsoft Graph API for application discovery and automated metadata management.
Dec 2024 –
Jun 2025
IH
Research Intern
IIIT Hyderabad  ·  Hyderabad, India
  • Led development of a Layout Preserving Document Translation system: Vision Transformer-based layout detection (92% accuracy), character metrics-based font detection, K-means colour palette extraction (88% accuracy).
  • Built FastAPI backend integrating Tesseract OCR for multi-language text extraction (English, Hindi, Telugu, German) while preserving document formatting.
  • Presented at IIIT Hyderabad research expo (May 2025) with live demonstrations to faculty and industry visitors.
Oct 2024 –
Jun 2025
IA
WSO2 API Developer
InfoAjax Consulting Pvt Ltd  ·  Remote, Hyderabad
  • Developed and deployed RESTful APIs for PLDT (Philippines' largest telecoms provider) using WSO2 Integration Studio for ticketing system integration.
  • Managed production ticket systems with secure credential management through GitLab and Choreo cloud platform with vault-based secrets management.
  • Engaged directly with international clients under strict SLAs to resolve production issues in environments with sparse framework documentation.
May – Oct
2024
ID
AI & ML Research Training
iHub-Data, IIIT Hyderabad  ·  Hyderabad
  • Completed intensive 6-month apprenticeship covering neural architecture design, LLM fundamentals, prompt engineering, model fine-tuning, quantisation, and deployment strategies.
  • Worked on research projects in document analysis, OCR systems, and computer vision under faculty mentorship.
Mar 2024 –
Jan 2025
Z
Founder & CEO
Zynthetix  ·  Hyderabad, India
  • Founded a synthetic data platform producing tabular, image, and text data using GANs and VAEs to address dataset scarcity in ML training.
  • Designed meta-controller architecture where a mother model orchestrates 200–1000 specialised child models to generate millions of rows without duplication.
  • Implemented prototype with transformer-based PII detection (98.5% accuracy) and 95% statistical similarity between synthetic and real data.
Jul – Oct
2023
XL
Data Science Intern
Xtraleap India Pvt Ltd  ·  Remote, Hyderabad
  • Exploratory data analysis, statistical modelling, and ML model development using Python (pandas, NumPy, scikit-learn, Matplotlib).
  • Feature engineering and model evaluation on real-world datasets in retail analytics and customer segmentation.

Selected Projects

DeepGuard project
DeepGuard — On-Device Deepfake DetectionPyTorch · Core ML · EfficientNet · FastAPI · SwiftUI

EfficientNet-B1 trained on 25k images; 98.62% accuracy in 55 min on M4 MacBook. Converted to 26MB Core ML model for real-time on-device iOS inference in <500ms. Full-stack FastAPI + React web app.

Transformers From Scratch
Transformers From ScratchPyTorch · Transformer · BPE · BLEU

"Attention Is All You Need" replicated end-to-end — full encoder-decoder with BLEU benchmarks. Separately implemented GPT-2 (124M params) from scratch with BPE tokeniser and full training loop.

BIE Bit-Index Encoding
BIE: Bit-Index EncodingPython · NumPy · Numba · Compression · Published

Novel neural weight compression achieving 40× ratio with MSE < 10⁻⁶. Published on Zenodo (2025) with complete open-source implementation including optimised sparse kernels and benchmarking suite.

Clinical Trial Similarity Analysis
Clinical Trial Similarity AnalysisPubMedBERT · FAISS · NLP · Python

PubMedBERT clinical NLP pipeline for trial matching. Achieved 99.5% accuracy on biomedical similarity tasks using domain-specific pre-trained embeddings and FAISS vector retrieval.

Zynthetix SWE Agent
Zynthetix SWE AgentPython · GitHub Actions · LLM · Agentic

Autonomous agent that reads GitHub issues, writes fix patches, and opens PRs — zero server needed. Runs entirely via GitHub Actions with Python orchestration and LLM-driven code editing.

GNN Suite
GNN SuitePyTorch Geometric · GCN · GraphSAGE · GAT

GCN, GraphSAGE, and GAT implementations with 81.9% accuracy on node classification benchmarks. Full comparison framework across architectures with detailed ablation studies.

GAN Implementation
GAN ImplementationPyTorch · GAN · DCGAN · MNIST

GAN trained on MNIST with multiple architectures (vanilla GAN, DCGAN). Full training pipeline with loss curves, generation quality metrics, and latent space interpolation.

Diffusion Model From Scratch
Diffusion Model From ScratchPyTorch · DDPM · UNet · Generative

DDPM implemented from math to generated images. ~130K parameter model trained end-to-end with full noise schedule, denoising UNet, and sample generation pipeline.

NL2SQL
NL2SQLPython · Transformers · SQL · HuggingFace

Natural language to SQL translation system achieving perplexity 1.42. Fine-tuned language model for structured query generation from plain-English questions.

Hybrid RAG Deepfake Detection
Hybrid RAG Deepfake DetectionRAG · CLIP · FAISS · PyTorch · Published

Privacy-first deepfake detection combining retrieval-augmented generation with visual inconsistency analysis. Uncertainty-aware classifier for enhanced interpretability. Published on Zenodo 2024.

ML From Scratch Blog Series
ML From Scratch Blog SeriesPython · NumPy · Pure ML · Educational

Implementations of Linear Regression, Logistic Regression, KNN, Naive Bayes, Decision Trees, Random Forest, SVM, K-Means, t-SNE, UMAP — all from math up with no sklearn wrappers.

Layout Preserving Document Translation
Layout Preserving Document Translation (IIITH)Vision Transformer · Tesseract · K-Means · FastAPI

Vision Transformer layout detection (92% accuracy) + multi-language OCR (English, Hindi, Telugu, German) preserving fonts, colours, and document structure. Research project at IIITH.

Technical Skills

Programming Languages

PythonJavaScriptTypeScriptSwiftJavaCC++RSQLPHPShell

AI / ML Frameworks

PyTorchTensorFlowHugging FaceLangChainLlamaIndexscikit-learnPyTorch GeometricGradioStreamlitUMAP

AI / ML Specialisations

LLM OptimisationSLM Fine-tuningRAGPrompt EngineeringTransformersAttention MechanismsGANsVAEsDiffusion ModelsGNNsOn-Device AIDeepfake DetectionQuantisationPruningDistillation

Computer Vision

EfficientNetVision TransformerMediaPipeOpenCVTesseractCLIPDocument AnalysisOCR

iOS / Mobile

SwiftSwiftUICore MLiOS DevelopmentVision FrameworkAVFoundationCore DataMetal MPSOn-Device Inference

Web & Backend

React.jsNext.jsFastAPINode.jsExpress.jsTailwindCSSViteReact NativeWebSocketREST APIs

Cloud & DevOps

AWSAzure OpenAIAzure ADGoogle CloudDockerVercelWSO2CI/CDGitServerless

Databases & Data Science

PostgreSQLMongoDBRedisFirebaseSupabasePandasNumPyMatplotlibFAISSSynthetic Data

Certifications

Awards & Honours

1st Place — University Webathon
KL University  ·  2022

Developed a comprehensive diet management platform in a 4-hour hackathon featuring personalised meal planning, calorie tracking, and nutritional recommendations. Competed against 50+ teams.

2nd Place — Design Expo
KL University  ·  2022–2023

Created an Arduino-based smart switchboard for IoT home automation with real-time energy monitoring and mobile app control. Recognised for innovative approach to energy efficiency and user-centric design.

SAC Momentum Award
Student Activity Council, KL University  ·  2023–2025

Awarded for technical leadership as Technical Lead. Mentored 20+ juniors in web development and ML, organised workshops for 100+ students, and coordinated 15+ technical events.

Contact

Email: gruheshkurra2@gmail.com

Phone: +91 8008258065

Location: Hyderabad, India

Twitter/X  ·  GitHub  ·  LinkedIn  ·  Hugging Face  ·  Blog  ·  Email

Open to collaborating on: Transformer Research  ·  LLM Development  ·  iOS On-Device AI  ·  Vision-Language Models  ·  Neural Compression  ·  Agentic Systems