~/portfolio

Mustakim Shikalgar

Software Engineer · MS @ ASU · IEEE Researcher

const focus = "distributed systems";
// Building systems that scale
~/about

MS Software Engineering student at Arizona State University (GPA 3.75), graduating May 2026. Published IEEE researcher, Technical Architecture Lead, and top 15% LeetCode solver. I work at the intersection of distributed systems, machine learning, and full-stack engineering.

// education
Arizona State University
MS Software Engineering
GPA: 3.75 · Graduating May 2026
// current_role
Technical Architecture Lead
METY Legal
Leading 6-person cross-functional team
~/projects
const projects = await fetchProjects("all");
01
NDAAI / Full Stack

METY Legal Chatbot

LangGraph · Django · React · FastAPI

$0.0008Cost per query

AI-powered personalized legal assistant (industry capstone, contract) with three distinct user modes across a distributed Django, React, FastAPI, LangGraph, and MongoDB stack. Served as Technical Architecture Lead for a 6-person cross-functional team across 7 sprints. Designed a dynamic FSPR knowledge profiling system that builds implicit user knowledge profiles through async background inference, targeting every response to each user's Most Critical Knowledge Gap. Architected a privacy-first two-layer system separating data orchestration from a stateless AI service enabling independent horizontal scaling, with Nginx reverse proxy for HTTPS termination and load balancing, Gunicorn multi-worker Django, Uvicorn-served FastAPI replicas behind a load balancer, and Celery with Redis for fault-tolerant background task queuing replacing Python threads. Rebuilt the LangGraph reasoning pipeline reducing cost by 85% to $0.0008 per query. Engineered token-aware rolling summarization for coherent long-session context retention and implemented lawyer-style intake behavior, context-aware legal document generation, parallel multi-dimension self-assessment evaluation, and a progressive course system spanning 9 legal domains.

$0.0008
Cost per query
85%
Cost reduction
9
Legal domains
3
User modes
LangGraphDjangoReactFastAPIMongoDBGPT-4oGPT-4o-miniPython
Private · NDA
02
Distributed Systems

AegisFlow

FastAPI · NATS JetStream · pgvector · OpenTelemetry

7Microservices

Production-grade AI reliability platform that treats every agent invocation as a probabilistic operation requiring scoring, validation, recovery, and tracing. Seven decoupled microservices (gateway, orchestrator, reliability engine, guardrail layer, semantic memory, replay engine, and chaos injector) communicate over a NATS JetStream event bus with shared Postgres, pgvector, and Redis state. The reliability engine combines per-provider circuit breakers, confidence scoring, hallucination heuristics, and adaptive fallback routing to recover non-deterministically failing agents without human intervention. Every execution is event-sourced for deterministic replay: given a trace ID and a frozen model snapshot the system reproduces any historical run bit-for-bit. Full OpenTelemetry, Prometheus, and Grafana observability stack with a chaos engine that injects latency, malformed outputs, and provider outages on demand.

7
Microservices
0.75
Min confidence
Retry depth
PythonFastAPINATSpgvectorOpenTelemetryDockerKubernetesRedisPostgreSQL
03
Distributed Systems

Distributed Key-Value Store

Raft Consensus · gRPC · Java · Docker

5Cluster nodes

Fault-tolerant distributed KV store implementing Raft consensus for leader election and log replication across a 5-node cluster. Tunable consistency supporting CP vs AP trade-offs via configurable quorum-based reads.

5
Cluster nodes
<10ms
Read latency
Strong
Consistency
JavaRaftgRPCDockerDistributed Systems
04
Machine Learning

Semiconductor Yield Predictor

Random Forest · L1 Selection · Streamlit

76%Recall

Binary classification system for predicting semiconductor wafer pass/fail on a severely imbalanced dataset. Two-stage pipeline using L1 regularization for dimensionality reduction followed by Random Forest with balanced class weighting.

76%
Recall
0.81
ROC-AUC
590→113
Features
Pythonscikit-learnStreamlitSECOMML
05
Computer Vision

PixelDrive: Road Scene Segmentation

U-Net · SegNet · DeepLabV3+ · TensorFlow

95.50%mIoU

Semantic segmentation of road scenes for autonomous driving. Trained and compared three architectures — U-Net, SegNet, and DeepLabV3+ — on the Lyft/Udacity Carla simulator dataset (13 semantic classes). Fixed 7 critical bugs in the original codebase including broken IoU metric, mask decoding, and softmax/logits mismatch.

95.50%
mIoU
45 FPS
Inference
13
Classes
PythonTensorFlowKerasU-NetDeepLabV3+CV
06
Research / Full Stack

Missing Persons Knowledge Graph

RDF · SPARQL · FastAPI · React

10K+Records

Knowledge graph integrating 10,000+ NamUs records using semantic web technologies. Replaced a $50/month GraphDB/Azure backend with FastAPI + RDFLib achieving sub-100ms SPARQL queries at zero infrastructure cost.

10K+
Records
<100ms
Query latency
$0
Infra cost
RDFSPARQLFastAPIReactOntologyPython
07
Systems / Algorithms

De Bruijn Genome Assembler

Java · Spring Boot · Graph Algorithms

99.9%Coverage

Genome assembler achieving 99.9% coverage on phiX174 (5,386 bp) using de Bruijn graphs and Eulerian cycle traversal. Implemented error correction including tip removal and bubble detection with multi-format support.

99.9%
Coverage
5,386bp
Genome
phiX174
Dataset
JavaSpring BootReactGraph AlgorithmsBioinformatics
08
Web / Games

Zebradoodle

React · Wordle / Quordle / Sedecordle / Nerdle

4Game modes

A browser word and math puzzle collection reconstructed from a 2022 Java CLI prototype I wrote in undergrad. Four game modes share one engine: Wordle (1 board), Quordle (4 boards), Sedecordle (16 boards, 21 guesses), and Nerdle (8-character math equation). Every day picks the same puzzle for everyone via a deterministic FNV-32 date hash; practice mode rolls a fresh puzzle on demand. Scoring is a JS port of the original Java compare() method, two-pass with proper repeated-letter handling. The Nerdle module ships its own tokenizer and evaluator that enforces operator precedence, exact integer division, and no-leading-zero rules. Stats, streaks, and daily-resume live in localStorage so the whole thing deploys as a static bundle.

Play today's puzzle
4
Game modes
14.8K
Word bank
2022
Original
React 17JavaScriptReact RouterCSSlocalStoragecanvas-confetti
// 8 projects · hover to expand
~/skills
export const techStack = {
// Languages01
JavaPythonTypeScriptC++SQL
// Backend & Systems02
Spring BootDjangoFastAPIgRPCRaft Consensus
// AI / ML03
LangChain / LangGraphscikit-learnPyTorchRAG PipelinesKnowledge Graphs
// Frontend04
ReactNext.jsTailwind CSSStreamlit
// Data & Infra05
MongoDBPostgreSQLDockerRedisKafka
};
// Always learning, always building
~/contact

Let's Connect

I'm always open to discussing distributed systems, AI infrastructure, or new opportunities.