Technology Stack

Tools I use in production — not tools I have a partnership with

Every technology listed here has been deployed in a production system. I apply sovereign-first model selection — the right tool depends on your use case, your data, and your budget. No partnership deals influence my recommendations.

AI Models & LLMs

Foundation models for reasoning, generation, and multimodal tasks

Claude (Anthropic)

Primary Stack

Advanced reasoning and code generation

GPT-5.2 (OpenAI)

Primary Stack

Multimodal AI capabilities

Mistral AI

Primary Stack

European open-weight models

Gemini (Google)

Multimodal reasoning at scale

Llama (Meta)

Open-source foundation models

Cloud Platforms

Enterprise-grade infrastructure for AI deployment

Amazon Web Services

Primary Stack

Bedrock, SageMaker, Lambda

Google Cloud

Primary Stack

Vertex AI, BigQuery, Cloud Run

Microsoft Azure

Primary Stack

Azure OpenAI, Cognitive Services

AI Frameworks

Tools for building, orchestrating, and deploying AI applications

LangChain

Primary Stack

LlamaIndex

Hugging Face

Primary Stack

Weights & Biases

Vector Databases

Semantic search and retrieval for RAG systems

Pinecone

Primary Stack

Weaviate

Qdrant

DevOps & Infrastructure

Containerization and orchestration for scalable AI

Docker

Kubernetes

Professional Certifications

Industry-recognized credentials demonstrating expertise

Professional Scrum Product Owner (PSPO I)

Scrum.org

Product

Product ownership and value maximization in Scrum

Issued 2019

Certified ScrumMaster (CSM)

Scrum Alliance

Agile

Agile facilitation and Scrum framework mastery

Issued 2018

SAFe 5 Agilist

Scaled Agile

Agile

Scaled Agile Framework for enterprise transformation

Issued 2021

AI Product Management

Product School

AI & ML

Building and managing AI-powered products

Issued 2023

Deep Learning Specialization

DeepLearning.AI

AI & ML

Neural networks, CNNs, RNNs, and transformers

Issued 2022

Why My Technology Choices Differ from Most Consultants

Model-Agnostic

No vendor partnerships influence my recommendations. I choose the model that fits your latency, cost, and accuracy requirements.

Production-Tested

Every tool here has been deployed in a system that handles real traffic. Lab-tested tools do not make this list.

Cost-Optimized

Most AI projects overspend on infrastructure by 3-5x. I right-size from the start — smaller models, smarter caching, efficient inference.

Swap-Ready

AI models change every quarter. My architectures abstract the model layer so you can swap providers without rewriting your application.

Not Sure Which Stack Fits Your Project?

Book a 30-minute call. I will assess your requirements and recommend the right combination of models, infrastructure, and frameworks — with cost estimates.

The AI Ecosystem

Every tool we evaluate, deploy, or recommend — with honest assessments.

Claude Opus 4.6

Anthropic

CloudCommercial

Most capable Claude model — complex reasoning, long-context analysis, agentic tasks.

Technology Stack

AI Models & LLMs

Claude (Anthropic)

GPT-5.2 (OpenAI)

Mistral AI

Gemini (Google)

Llama (Meta)

Cloud Platforms

Amazon Web Services

Google Cloud

Microsoft Azure

AI Frameworks

LangChain

LlamaIndex

Hugging Face

Weights & Biases

Vector Databases

Pinecone

Weaviate

Qdrant

DevOps & Infrastructure

Docker

Kubernetes

Professional Certifications

Professional Scrum Product Owner (PSPO I)

Certified ScrumMaster (CSM)

SAFe 5 Agilist

AI Product Management

Deep Learning Specialization

Why My Technology Choices Differ from Most Consultants

Model-Agnostic

Production-Tested

Cost-Optimized

Swap-Ready

Not Sure Which Stack Fits Your Project?

The AI Ecosystem

Claude Opus 4.6

Claude Sonnet 4.6

Claude Haiku 4.5

Claude Code

Model Context Protocol (MCP)

Claude Agent SDK

Mistral Large 2

Mistral Small 3.1

Mistral Nemo 12B

Codestral

Pixtral Large

Mistral Embed

Mistral Forge

Le Chat Enterprise

Llama 3.3 70B

Llama 3.2 1B/3B

Llama 3.2 11B/90B Vision

Gemma 3 (1B/4B/12B/27B)

Phi-4 14B

Phi-4-mini 3.8B

Qwen 2.5 (0.5B–72B)

Qwen2.5-Coder 32B

DeepSeek-R1

DeepSeek-V3

Falcon 3 (1B/3B/7B/10B)

SmolLM2 (135M–1.7B)

Ollama

vLLM

TGI (Text Generation Inference)

llama.cpp

LM Studio

Transformers.js

ONNX Runtime

LiteLLM

Unsloth

Axolotl

LLaMA-Factory

torchtune

PEFT

TRL

DeepSpeed

Accelerate

Megatron-LM

Hugging Face Hub