Vahid Faraji — Applied AI

Vahid Faraji

Senior Applied AI Specialist — Agentic Systems & Context Engineering

Senior Applied AI who designs agentic systems, context engineering workflows, and evaluation-driven LLM automation for enterprise use cases. 8+ years in analytics and data, expert on multi-agent architecture, retrieval pipelines, and cost-aware AI tooling that turn unstructured business tasks into reliable production systems. Hands on experience on enterprise data, agentic develop and cloud infra.

Creator of Tokalator: open-source infrastructure for context optimization and cost-aware agent execution across 15 models. Google-certified Generative AI Leader.

Agentic Systems Design Multi-Agent Design Product Engineer Context Engineering Token Economics Enterprise Data RAG Pipelines DB Vector MCP and Tool Call

GitHub LinkedIn Email tokalator.wiki

Try Tokalator Read Papers Download CV

01 / Signals

agent_arch

Multi-Agent Orchestration

Designed hierarchical agent frameworks (planner → specialist → reviewer → tool) deployed at 20M+ user scale

eval_loop

Evaluation-Driven AI

95% LLM accuracy, 86.2% rule-based segment accuracy, 0.94 avg confidence — measured on 14K+ positions

token_budget

Context & Cost Engineering

Built tokalator.wiki — open-source context optimization toolkit, VS Code extension & caching ROI calculators

prod_impact

Production Impact

30K+ job titles normalized, 72.5% faster classification, 60% fewer manual tasks — shipped to real users

02 / Work

a. Open Source

open source infrastructure

Tokalator — Context Engineering Toolkit

Open-source infrastructure for context optimization and cost-aware agent execution. (1) VS Code extension with real-time token budget monitoring, tab relevance scoring, and 11 chat commands; (2) web platform with Cobb-Douglas quality-of-output calculators; (3) reusable catalog of context engineering prompts, agents & instructions. Covers 15 models across Anthropic, OpenAI & Google with formal cost models, caching break-even analysis, and token allocation optimization.

arch: context_layer → budget_monitor → relevance_scorer → cost_optimizer → 15_model_coverage

Next.js·React 19· VS Code Extension·Context Engineering·Token Economics

Marketing Site web

VS Code Extension ext

b. Research

research ACM CAIS '26

Multi-Agent Position Classification

MCP-based multi-agent system for cross-lingual normalization of free-form job titles to O*NET & ESCO taxonomies. Root Content Agent delegates to specialized sub-agents (web search, content extraction, DB queries, trend analysis) with multi-layered memory. Evaluated on 14K+ positions — 72.5% reduction in classification time, 86.2% rule-based accuracy, 95% LLM accuracy (0.94 avg confidence). Accepted as demo paper at ACM CAIS '26.

arch: planner_agent → [search_agent | db_agent | extract_agent | trend_agent] → evaluator → taxonomy_output

MCP·LangGraph· O*NET·ESCO·Multi-Agent·Evaluation

Multi-agent architecture from ACM CAIS '26

Agent Architecture arch

Eval Dashboard eval

c. Enterprise

production enterprise

Enterprise AI Transformation

Agentic workflows across Sales (report automation), Finance (text-to-SQL), and R&D — designed, evaluated, and handed off to enterprise operations. AI-powered feedback automation serving 100+ users with 60% reduction in manual tasks. Token-efficient prompt and context engineering adopted as org standard.

arch: task_decomposer → specialist_agents → human_checkpoint → eval_loop → enterprise_handoff

FastAPI·LangGraph· Agno·Pydantic·Arize

production

Text-to-SQL — Natural Language Reporting

Natural-language to SQL pipeline turning business questions into structured reports. RAG-augmented query generation agent with schema grounding, guardrails for query safety, and human escalation for ambiguous inputs.

arch: nl_parser → schema_rag → sql_generator → validator → report_renderer

Python·SQL· RAG·BigQuery·Azure Synapse

production

Self-Learning Agents

Autonomous agents that improve through feedback loops — evaluation-first design with Arize observability, LangSmith tracing, structured failure recovery, and controlled autonomy boundaries.

arch: executor → outcome_evaluator → feedback_store → self_refinement → human_gate

LangGraph·MCP· Agno·Arize·LangSmith

03 / Publications

ACM CAIS '26 demo paper

Tool-Augmented Multi-Agent Systems for Job Position Normalisation

Vahid Faraji et al. · Kariyer.net R&D · ACM CAIS 2026 · Accepted

arXiv 2601.22885

Leveraging LLMs For Turkish Skill Extraction

Vahid Faraji et al. · arXiv preprint · 2026

arXiv 2604.08290

Tokalator: A Context Engineering Toolkit for AI Coding Assistants

Vahid Faraji · arXiv preprint · 2026

04 / Talks & Media

Bridge Summit talk

Kayseri University talk

Van University talk

05 / Experience

2025 — Present

Senior Applied AI Specialist

Kariyer.net

Designed agentic reporting pipelines (Sales automation, Finance text-to-SQL) and implemented evaluation loops for LLM-powered workflows. Introduced token-efficient prompt and context engineering standards adopted org-wide. AI feedback automation serving 100+ users, 60% reduction in manual tasks.

↑ Google GenAI Leader · Perplexity ilab $40K · ACM CAIS '26 first author

2022 — 2025

Senior Data Product Manager

Kariyer.net

Built AI-powered search, agentic workflows, and cost optimization systems for Turkey's largest job platform. Led cross-functional teams delivering data products to 20M+ users. Normalized 30K+ job titles into O*NET & ESCO taxonomies — replaced 6 months of manual work.

↑ 20M+ users · 30K+ positions normalized · agentic search shipped

2021 — 2022

Business Analyst

WorqCompany

Financial modelling, risk logic, and KPI design for HR-tech products.

2019 — 2021

Co-Founder

Defaro.io

Labor market analytics startup — built the data pipeline and product strategy from scratch.

06 / Certifications

Google Dec 2025 — Dec 2028

Generative AI Leader

Generative AI, Google Gemini, AI strategy & leadership.

LangChain Jan 2025

Introduction to LangSmith

LLM observability, tracing, evaluation and debugging with LangSmith.

Techcareer.net 2026

Uygulamalı AI Araçlar, Agent'lar ve Otomasyon

Practical AI tools, agents and automation — agentic workflows and enterprise AI delivery.

07 / Tech Stack

Agent Systems

LangGraph / MCP / Agno
OpenAI Agents SDK
LangChain
Agent Eval & Benchmarking
Observability (Arize, LangSmith)
Guardrails & Safety
Agent Garden (Google)

Languages

Python (FastAPI, Async)
TypeScript
SQL / Text-to-SQL
Pydantic

Data & Retrieval

PostgreSQL / Supabase
Azure Synapse / BigQuery
Vector DBs / RAG
ETL / Data Ingestion
DataHub
Metadata Management

Automation

n8n
Power Automate
GitHub Copilot Workflows

AI Dev Kit

Docker
Azure / Google Cloud
Next.js / React (Vercel, V0)
VS Code, Claude Code

08 / Now

~ Currently Building

├── Long-horizon agent reliability patterns
├── Structured evaluation frameworks for agentic systems
├── Cost-aware multi-agent orchestration at enterprise scale
└── Tokalator v2 — expanded model coverage + agent-native context APIs

~ Currently Reading / Shipping

├── Agent eval benchmarks beyond single-turn accuracy
├── Prompt caching break-even economics across frontier models
├── Production traces: what trustworthy autonomy looks like
└── Open-weights vs. hosted cost curves for agentic workloads