Skip to content
View w00jay's full-sized avatar

Block or report w00jay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
w00jay/README.md

Hi, I'm Woojay πŸ‘‹

Director of Platform Engineering at ZeroEyes, building AI infrastructure on FedRAMP-compliant Kubernetes.

5+ years leading platform teams, 10+ years hands-on across systems and cloud-native infra. Currently extending that discipline to the AI platform layer β€” MCP servers, RAG systems, GPU inference, LLM evaluation, and the security models regulated industries need for agentic workloads.

What I'm working on

  • 🧠 MCP servers and RAG systems in production (see arxiver ↓)
  • 🎯 Dimension-routed LLM evaluation on local GPUs (see nite-eval ↓)
  • πŸ”Š GPU-backed inference serving patterns (see transcriber ↓)
  • πŸ” FedRAMP Moderate platform buildout at work β€” AWS, EKS, Istio/Envoy, FIPS-140

Pinned projects

  • arxiver β€” Personal arXiv research assistant. FastMCP server, ChromaDB semantic search, TensorFlow recommendations, LLM summaries. Same backend exposed as CLI, REST API, Streamlit UI, and MCP.
  • nite-eval β€” Autonomous overnight LLM evaluation pipeline. 15 multi-turn agentic tasks, dimension-routed dual-judge scoring, SQLite checkpoint/resume. Targets and judges on separate GPUs.
  • transcriber β€” Streaming STT with NVIDIA Canary-1B on CUDA. FastAPI inference endpoint, chunked audio streaming, proof-of-pattern for GPU model serving.
  • artemisee β€” Real-time 3D visualization of NASA's Artemis II using JPL Horizons ephemeris data. Hermite spline interpolation, client-side celestial math, 10 live data feeds.

Elsewhere

Stack I reach for

Python Go Kubernetes AWS GCP Pulumi OpenTelemetry llama.cpp MCP FastAPI Supabase Chroma Claude Code

Before this

AWS (Systems Development Engineer) Β· Apex Clearing (SRE) Β· LINBIT (software engineer, DRBD) Β· co-founded an embedded hardware startup Β· US Air Force officer

Pinned Loading

  1. arxiver arxiver Public

    Personal arXiv research assistant: semantic search, ML-based recommendations, LLM summaries, and an MCP server so Claude (or any agent) can query your reading history. FastAPI + ChromaDB + TensorFlow.

    Jupyter Notebook

  2. artemisee artemisee Public

    Real-time 3D visualization of NASA's Artemis II mission using JPL Horizons data

    TypeScript

  3. nite-eval nite-eval Public

    Autonomous overnight LLM eval pipeline for local GGUF models β€” multi-turn agentic tasks, dimension-routed dual-judge scoring, SQLite-backed comparison reports. Built for llama.cpp + llama-swap on d…

    Python

  4. transcriber transcriber Public

    Streaming speech-to-text transcriber with nvidia/canary-1b

    Python 5