Available · Remote worldwide Selected Work · 2020 — 2026
Portfolio

Yevhenii
Budanov

Senior Software Engineer & AI Solutions Architect — building production LLM, RAG and full-stack systems for FinTech, HealthTech, LegalTech, EdTech and scientific research.

15+years engineering production software
7+years applied AI & LLM systems
9selected projects in this dossier
10distributed engineers led across 5+ countries

A practitioner, not a watcher.

I build production AI systems that ship — LLM-powered apps, RAG pipelines, agentic workflows, and the boring infrastructure underneath that keeps them running.

Over the last seven years I've focused on applied AI in healthcare, legal, research, FinTech, and education — partnering with companies like JoVE and Ozmosys, plus other B2B clients across multiple industries. Earlier years were full-stack engineering on PHP/JS billing platforms, CRMs and IPTV services with thousands of subscribers.

I write production code, design APIs and data schemas, deploy to AWS and GCP, make architectural calls — then I lead small distributed teams (up to 10 engineers across multiple countries) through delivery. I integrate AI-assisted development (Claude, Cursor, GPT-based assistants) into the workflow with appropriate review discipline.

My information-security background (university degree) keeps shipping speed honest about safety: HIPAA-compliant deployments, secure auth, RBAC, secrets management.

Two clients I'm allowed to name.

JoVE

Backend architecture · 2021–2022 · Medical Chatbot 2022–2023

The world's leading peer-reviewed scientific video journal and education platform. I designed the initial backend architecture (Symfony + PostgreSQL) and built REST APIs powering a Next.js frontend with SSR/SSG. Later, I led delivery of a HIPAA-compliant medical knowledge chatbot in partnership with JoVE.

1,800+
Universities & companies
30,000+
Peer-reviewed videos

Ozmosys

Engineering partner · 2020 → 2024

Generative AI for legal and financial professionals. Across multiple engagements I owned backend re-architecture, frontend rebuild, NLP integration for entity extraction, AI-powered search, and a REST API for CRM integrations. Later projects extended into LLM-based legal document search and an autonomous AI research assistant with citation-grounded reports.

AmLaw 100
Firms trust the platform
1M+
Partner-level memos trained on

Nine projects, in reverse chronology.

Most engagements through Data Pro Software Solutions (Boston, MA). Client names confidential except where noted as partnerships.
012025–2026
FinTech Cloud-native Full-stack TypeScript

Cloud-Native Trading Infrastructure & Microservices Platform

Production-grade serverless platform for scalable financial applications combining microservices, infrastructure-as-code, and real-time data processing.

What I built

  • TypeScript/Node.js microservices on AWS Lambda + API Gateway with clear service boundaries and independent scaling
  • Fully reproducible Terraform infrastructure (RDS, ElastiCache, S3, CloudFront, IAM)
  • Custom DB migration runner on ECS Fargate with checksum validation, advisory locking, DynamoDB audit tracking
  • Artifact-based deployment with versioned builds in S3, decoupled from cross-repo coupling
  • Private VPC architecture with NAT, SSM-based access, and service-to-database isolation
  • Redis caching and rate-limiting with explicit key schemas and TTL policies

Architecture

SPA Client CloudFront + S3 API Gateway Cognito auth λ Trading Service λ Account Service λ Pricing Service RDS PostgreSQL ElastiCache Redis Migrations ECS Fargate Terraform IaC · Private VPC · NAT · SSM access · IAM
Serverless microservices on AWS · IaC-managed end-to-end
TypeScriptNode.jsAWS LambdaAPI GatewayRDSElastiCacheECS FargateDynamoDBTerraformReactVite
022025–2026
FinTech Real-time ML Event-driven

Cognitive Behavior Inference Engine

An AI/ML system for real-time detection of behavioral patterns and cognitive biases — loss-chasing, FOMO, overconfidence — in user activity, enabling intelligent intervention and decision support.

What I built

  • Event-driven pipeline (events → feature extraction → inference) with low-latency streaming
  • Redis-based feature store for live behavioral signals: timing, sequences, intensity, context-aware metrics
  • Gradient-boosted models (XGBoost, LightGBM) for behavioral pattern detection
  • Personal-baseline modeling: the system learns each user's signature first, then flags meaningful deviations from THAT
  • Synthetic data pipeline for training bootstrap and signal-quality validation
  • FastAPI inference service on ECS for scalable real-time predictions
  • Multi-level intervention layer (Guardian) with alerts, constraints, adaptive guidance

Architecture

User Events trades · clicks Kafka / Redpanda Redis FS feature store FastAPI on ECS XGBoost LightGBM Personal baseline JanusGraph cascade detect Guardian intervention
Streaming events → live features → real-time inference → Guardian intervention
PythonFastAPIXGBoostLightGBMKafkaRedpandaRedisJanusGraphECS
032025
Data Engineering Modern data stack

Crypto Market Analytics Pipeline

End-to-end data pipeline using the modern data stack — Python ingestion, BigQuery warehouse, dbt transformations, Airflow orchestration, all reproducible via Terraform.

What I built

  • Custom Python ingestion from CoinGecko API with retry, rate limiting, idempotent loading
  • BigQuery warehouse with partitioned/clustered tables — measured ~11× scan reduction on common analytical queries
  • Three-layer ELT (raw → staging → marts) with dimensional modeling: fact + dim tables
  • dbt transformations: incremental models, SCD Type 2 snapshots, data-quality tests (not_null, unique, accepted_range, source freshness)
  • Airflow DAG: ingestion, transformation, testing, freshness checks with failure alerting
  • Terraform for BigQuery datasets, tables, service accounts, IAM
  • Dockerized stack: custom Airflow image with dbt, docker-compose for full local environment

Pipeline

CoinGecko REST API Raw BigQuery Staging dbt Marts fact + dim BI analytics Airflow DAG · orchestration · alerting · freshness checks Terraform IaC · BigQuery datasets · IAM · service accounts
~11× scan reduction via partitioning & clustering
PythonBigQuerydbtAirflowAirbyteTerraformDockerGCP
042024
Research Tools Agentic AI Ozmosys partnership

Autonomous AI-Powered Research Assistant

An AI-driven research assistant that breaks down complex research tasks into structured steps — topic analysis, sub-question generation, live data collection, summarization, final synthesis — with every source transparently cited.

What I built

  • AI agent architecture for autonomous, multi-step research tasks (LangGraph-based planner)
  • Real-time web crawling with intelligent source filtering and provenance tracking
  • Hybrid-model inference for faster, bias-minimized analysis
  • Structured research reports tailored to analysts, journalists, policymakers, scientists
  • Next.js / React frontend with streaming responses
  • Python backend with ChromaDB for retrieval; factual-consistency guardrails

Agent loop

Topic user prompt Planner LangGraph Sub-questions decomposition Web Crawl filtered Retrieval ChromaDB Synthesis + guardrails replan if gaps
Planner decomposes → tools execute → synthesis evaluates → loop or finalize
PythonLangChainLangGraphChromaDBNext.jsReactSSE
052024
EdTech Computer Vision · OCR

AI-Powered Student Test Scoring Platform

Intelligent assessment system for elementary education that automatically grades scanned handwritten student tests — designed for offline classroom deployment.

What I built

  • OCR + geometric layout detection to separate static template from handwritten content
  • Region segmentation across multiple test types and variable layouts
  • Rule-based and pattern-based grading logic for handwritten answers
  • Designed for low-connectivity / offline classroom deployment — no cloud dependency
  • Reduced manual bias and ensured consistent evaluation across student submissions

Pipeline

Scanned paper test Layout detection Static template Handwritten content OCR + grading Score & feedback vs answer key
Layout detection isolates static vs. dynamic regions for accurate OCR
PythonOCRComputer VisionLayout Detection
062023–2024
LegalTech RAG · Llama-2 Ozmosys partnership

GPT-Based Legal Document Search & Knowledge Retrieval

A secure LLM-based knowledge retrieval system for law firms and legal departments — conversational search across large legal corpora with similarity-based vector retrieval.

What I built

  • Chat-based interface for document query and legal research
  • Similarity-based search across massive legal repositories (Llama-2 + ChromaDB embeddings)
  • Summarization, citation, and context-aware document retrieval via NLP and LLMs
  • Hybrid model routing for compliance-sensitive queries
  • Scalable infrastructure with data isolation on dedicated servers
  • Iterative refinement experience — attorneys refine queries with follow-up questions
PythonLlama-2ChromaDBLangChainReactNLP

Outcome

  • Reduced legal research time per case from hours to minutes
  • Improved precision via citation-grounded answers — attorneys see exactly which document and which passage support each claim
  • Compliance-safe routing kept regulated queries on approved models
  • Data isolation per client meant no cross-firm leakage in shared infrastructure
072022–2023
HealthTech HIPAA-compliant JoVE partnership

AI-Powered Medical Knowledge Chatbot

A secure, AI-powered medical chatbot enabling healthcare professionals to search complex medical knowledge bases via natural language and retrieve context-rich answers, document citations, and research summaries in seconds.

What I built

  • Vector search architecture: indexed thousands of medical documents (PDF, DOC, TXT) with HuggingFace embeddings, ChromaDB for similarity search
  • Web-based conversational chatbot interface with contextual follow-ups and dynamic literature filtering
  • HIPAA-compliant deployment on dedicated servers with client-specific data isolation
  • LangChain orchestration coordinating embeddings, search, and answer generation across models
  • Custom medical NLP pipeline: summarization, question rewriting, chunk-level similarity, citation tracking
PythonLangChainHuggingFaceChromaDBNLPHIPAA

Outcome

  • Healthcare professionals save hours of manual document review per query
  • Concise, evidence-based answers with full citations
  • Faster diagnosis support, treatment planning, and research workflows
  • Compliance-grade isolation eliminated cross-client data leakage risk
082021–2022
Scientific Publishing JoVE Backend Architecture

JoVE — Scientific Video Platform

Senior full-stack engineer on JoVE — the world's leading peer-reviewed scientific video journal and education platform, serving 1,800+ universities and 30,000+ peer-reviewed videos.

What I built

  • Designed initial backend architecture (Symfony + PostgreSQL)
  • Built REST APIs powering a Next.js frontend with SSR/SSG
  • Active backend development and frontend integration through 2022
  • Supported JoVE's product family: Journal, Encyclopedia of Experiments, Core, Science Education, Lab Manual, Business, Quiz
SymfonyPHPPostgreSQLNext.jsREST APISSR/SSG

Scale

  • 1,800+ universities and companies worldwide use the platform
  • 30,000+ peer-reviewed videos in the catalog
  • Multi-region content delivery with localization in 13 languages
  • Used across research, undergraduate education, K-12, and biopharma
092020
LegalTech · FinTech Ozmosys NLP · Re-architecture

AI-Powered Legal & Business Intelligence Platform

Re-engineered Ozmosys's award-winning legal data intelligence platform from the ground up — owning backend and frontend rebuild, NLP integration, QA, and project delivery.

What I built

  • Re-architected the platform for performance and scale
  • Integrated NLP for entity extraction from unstructured data — newsletters, emails, social feeds
  • AI-powered search with entity recognition, faceted filtering, and customizable alerts
  • REST API for CRM integrations
  • Streamlined development workflows and operational practices for long-term efficiency
PythonNLPEntity RecognitionREST APIRe-architecture

Context

  • Ozmosys is trusted by AmLaw 100 firms and global financial institutions
  • Platform aggregates content from LexisNexis, WestLaw, Bloomberg, and other top sources
  • Now trained on 1M+ partner-level memos for domain-specific precision
  • Modernized intelligence hub that eliminates information overload in high-stakes legal environments