Hi, I'm
Varshith.

B.Tech Data Science student and full-stack builder. Founder of OQENS and CTO at Unifesto, obsessed with shipping production-grade platforms and AI tools.

Fuelled by open source, cloud native architectures, and raw data. Expertise in React.js, Node.js, PostgreSQL, GCP, and OCI.

Varshith Chowdary
AI INFRASTRUCTURE
FULL STACK DEVELOPER
CLOUD ARCHITECT
SYSTEMS THINKER
PROBLEM SOLVER
AI INFRASTRUCTURE
FULL STACK DEVELOPER
CLOUD ARCHITECT
SYSTEMS THINKER
PROBLEM SOLVER
AI INFRASTRUCTURE
FULL STACK DEVELOPER
CLOUD ARCHITECT
SYSTEMS THINKER
PROBLEM SOLVER

About

B.Tech Data Science student building full-stack web applications and AI-powered tools.

Founder of OQENS — an independent search engine ecosystem. Currently CTO at Unifesto, leading technology strategy and cloud infrastructure across GCP and OCI.

Self-driven builder who leverages React, Node.js, and PostgreSQL to ship real, production-grade products from scratch.

Experience

2026—

Chief Technology Officer

Unifesto
2026

Founder & Lead Developer

ORENAI
2025

Founder

OQENS Agency

Core Technologies

React
React
Next.js
Next.js
Tailwind CSS
Tailwind CSS
Python
Python
FastAPI
FastAPI
PostgreSQL
PostgreSQL
Docker
Docker
Supabase
Supabase
React
React
Next.js
Next.js
Tailwind CSS
Tailwind CSS
Python
Python
FastAPI
FastAPI
PostgreSQL
PostgreSQL
Docker
Docker
Supabase
Supabase
React
React
Next.js
Next.js
Tailwind CSS
Tailwind CSS
Python
Python
FastAPI
FastAPI
PostgreSQL
PostgreSQL
Docker
Docker
Supabase
Supabase

AI Infrastructure

Engineered scalable AI infrastructure using highly-optimized, self-hosted LLM inference systems to deliver sub-second reasoning responses.

Deployed Agent Network

Qwen 2.5 3B
Qwen 2.5 3BPrimary Reasoning Agent

Handles complex multi-turn logic, orchestrates tool calling, and generates final user responses. Quantized to GGUF Q5_K_M for optimal VRAM usage.

Phi-4
Phi-4Code & Math Generator

Specialized agent reserved for high-accuracy algorithmic tasks and structured JSON outputs. Runs on Q4_K_M for rapid inference.

Gemma 2B
Gemma 2BFast Routing & Filtering

Lightweight model used exclusively for rapid prompt classification, safety filtering, and RAG query generation before hitting heavier models.

Backend Inference Engine

llama.cpp
llama.cpp

Highly optimized C++ inference backend maximizing VRAM efficiency on edge hardware.

Ollama
Ollama

Streamlined local runner for quantized GGUF models, achieving sub-second Time-To-First-Token.

FastAPI Orchestration

High-throughput asynchronous Python endpoints handling concurrent LLM streams, request queuing, and dynamic load balancing across multiple inference nodes. Built to scale proprietary RAG pipelines.

Cloud & DevOps

Highly available
deployments.

AWS Operations
AWS Operations
AWS Operations

Architecting serverless ecosystems, managing high-availability EC2 clusters, and provisioning scalable RDS databases tailored for production workloads.

Google Cloud
Google Cloud
Google Cloud

Deploying intelligent compute engines, orchestrating containerized workloads via GKE, and leveraging native GCP machine learning infrastructure.

Microsoft Azure
Microsoft Azure
Microsoft Azure

Integrating enterprise-grade App Services, configuring secure active directories, and managing resilient cloud storage and CDN solutions.

Oracle OCI
Oracle OCI
Oracle OCI

Setting up high-performance bare-metal servers, automated webhook deployment pipelines, and robust virtual cloud networks for extreme scalability.

Linux & DevOps
Linux & DevOps
Linux & DevOps

Writing infrastructure as code, constructing multi-stage CI/CD GitHub Actions pipelines, and configuring hardcore Systemd & Nginx reverse proxies.

Featured Work

AI Conversational Platform

ORENAI

Engineered scalable AI infrastructure using Groq and Llama-3 to deliver sub-second reasoning responses. Orchestrated custom RAG pipelines and intelligent tool execution.

Building • Coming Soon
University Event Marketplace

Unifesto

Architected a high-throughput multi-tenant platform for university fests. Designed fault-tolerant PostgreSQL schemas with Prisma and ultra-fast Next.js caching layers.

Creative Agency Hub

OQENS

Built a visually stunning, performant agency portfolio prioritizing extreme Lighthouse scores. Integrated advanced Framer Motion layout animations and React-Three-Fiber models.

Let's Build
Something
Massive.

I am currently open for new opportunities in AI Infrastructure, Backend Engineering, and scalable Cloud-native development.

© 2026 Varshith Chowdary.
All rights reserved.