Hi, I'm Omkar Nayak.

A Scientist
Self-driven, quick starter, passionate programmer with a curious mind who enjoys solving complex and challenging real-world problems.

Mathematics and Computer Science Student at the University of Michigan - Ann Arbor. Machine Learning Engineer at Synafox AI and Research Associate II at Zhou Lab.

Currently developing autonomous agentic workflows for clean energy infrastructure and working on publications in LLMs and GWAS.

Projects

CrewAI Workflow
Autonomous Energy Infrastructure Agent

Multi-agent system using CrewAI for monitoring and optimizing clean energy infrastructure through digital twin simulations.

Project Details
  • Tools: CrewAI, LangChain, Python, Docker
  • Autonomous monitoring of digital twin simulations
  • Predictive model optimization for resource allocation
  • MLOps integration with CI/CD pipeline
RAG Pipeline
Production RAG Pipeline

Scalable RAG pipeline using LangChain and vector databases for domain-specific LLM applications.

Project Details
  • Tools: LangChain, Vector Databases, Python, FastAPI
  • Low-latency context-aware inference
  • Enterprise-grade scalability
  • Domain-specific knowledge integration
GWAS LLM
LLM-Enhanced GWAS Analysis

Novel phenotype recognition method using LLMs for Genome Wide Association Studies with 50,000+ genetic records.

Project Details
  • Tools: Python, PyTorch, LangChain, PubMed API
  • Tested on 12 distinct phenotypes
  • 50,000+ genetic sequencing records analyzed
  • Electronic Health Records integration
Medical AI
Expert Reasoning in Medical LLMs

Integrating expert medical reasoning into LLMs, improving diagnostic accuracy by 38% on 80,000+ health records.

Project Details
  • Tools: Python, vLLM, NeMo Guardrails, PyTorch
  • 38% improvement in diagnostic accuracy
  • 80,000+ precision health records analyzed
  • HIPAA-compliant implementation
MTF-projects
Monitoring the Future

Analysis of the Monitoring the Future dataset using data analytics tools to discover trends in attributes of adolescents in the US and predicting their political learning.

Project Details
  • Tools: Pandas, scikit-learn, Python
  • Data analysis and visualization
  • Predictive modeling
  • Model performance monitoring
News Aggregation
News Aggregation Platform

Intelligent news aggregator using NLP and machine learning to classify, summarize, and recommend trending articles from multiple sources.

Project Details
  • Tools: Python, scikit-learn, Flask, BeautifulSoup, NLP
  • Automated web scraping and article classification
  • Real-time topic detection and summarization
  • Personalized news recommendations

Experience

2025
Machine Learning and Systems Engineer
  • Designed and developed autonomous agentic workflows using CrewAI for the continuous monitoring and optimization of digital twin simulations improving the management of predictive models for system analysis and resource allocation.
  • Integrated of multiple Agentic Frameworks into the CI/CD pipeline, automating the software development cycle by using MLOps principles to minimize manual intervention and accelerate the Agile iteration frequency.
  • Deployed scalable, production-grade RAG pipelines using LangChain and vector databases to serve domain-specific LLMs. This architecture ensured low-latency, context-aware inference for specialized enterprise applications.
  • Tools: Python, CrewAI, LangChain
May 2025 - Present | Detroit, Michigan
Research Assistant II
  • Enhanced Genome Wide Association Studies (GWAS) at the UM School of Public Health by authoring a paper on a novel phenotype recognition method, the bespoke Large Language Models (LLMs) successfully tested on 12 distinct phenotypes from over +50,000 Genetic Sequencing and Electronic Health Records.
  • Advanced Deep Learning diagnostic accuracy by about 38% by collaborating on a paper within a cross-functional team of doctors at Michigan Medicine to integrate expert reasoning into LLMs. This work analyzed +80,000 precision health records from +24,000 patients, using NeMo Guardrails to ensure data security and topic relevance.
  • Accelerated AI-driven research cycles by 15x on a HIPAA-regulated HPC cluster by implementing a parallelized software architecture with vLLM within an Agile development framework while utilizing Site Reliability Engineering techniques to employ information security methods to protect sensitive data.
  • Tools: Python, LangChain, LangGraph, PubMed Central API, PyTorch, vLLM, Bash
August 2024 - Present | Ann Arbor, Michigan
2024
Machine Learning Engineering Intern
  • Deployed Predictive Modeling using Lightwood and Hugging Face models on Docker to develop 5+ proprietary ML Models.
  • Analyzed information using the internal database and integrated ML models into the SAAS platform services of ”Suuchi GRID”, resulting in a 27% increase in orders.
  • Tools: Python, MindsDB, Hugging Face, Lightwood, PyTorch, Apache Superset, Docker, PostgreSQL
May 2024 - November 2024 | Jersey City, New Jersey

Education

University of Michigan - Ann Arbor

Ann Arbor, Michigan, USA

Degree: Bachelor of Science in Computer Science and Mathematics

    Relevant Courseworks:

    • Graduate Machine Learning
    • Graduate ML and SP in Biomedical Sciences
    • Graduate Probability Theory
    • Machine Learning Research
    • Computer Vision
    • Advanced Data Science
    • Data Structures and Algorithms

Skills

Frameworks

Libraries

Developer Tools

Contact