Hi, I’m Vlad 👋

I’m a software & AI architect, founder, and Microsoft MVP on AI. I write and speak about machine learning in general and AI in particular. Follow me on Substack or plain old RSS.

Here are some of my highlights from the past decade:

  • – Co-founded ZenAIos, where we build AI solutions for the public and private medical sector.
  • – Co-founded NRGI.ai, a startup focused on forecasting energy prices and connecting small businesses with energy suppliers. As the technical co-founder, it was exciting to know my price forecasts were used by big names such as Electrica and Hidroelectrica.
  • – Partenered with some old friends and joined Strongbytes as Head of AI to try and create a kick-ass outsourcing company. Before that I had served as technical director for Maxcode, also focused on outsourcing.
  • – Co-founded NDR – an AI conference, the first of its kind in IaÅŸi – along with the friends at Codecamp. Handled the agenda, scouted for speakers and MC’d every edition.

You’ll find some of the apps I’ve built on GitHub and on the Chrome Web Store, while videos of some of my favorite talks can be found on YouTube.

To get in touch, just pick your favorite social platform below and drop me a line 👋.

Pros and Cons of using a Model Router

Model routers for LLMs: when they shine, when they fail, how to evaluate them, and a simple starter approach

August 13, 2025 Â· 3 min

An Awesome List of AI Assisted Development Tools

A non-comprehensive but still awesome list of AI development tools – IDEs, extensions, CLIs, and asynchronous coding agents

August 12, 2025 Â· 6 min

How to configure aider and Continue with o3-mini and DeepSeek-R1 deployed in Azure AI Foundry

A step-by-step guide to configure aider and Continue with Azure-hosted o3-mini and DeepSeek-R1 LLMs for AI-assisted development

February 5, 2025 Â· 4 min

Grabit 0.7 released, and how to use an LLM to keep the README in sync with the code

I’ve just released v0.7 of Grabit, my little command line app for saving full-text copies of webpages. It brings support for saving Reddit posts (I really wanted to do this), and custom user agents (I didn’t really want to do this, but here we are). It also prettifies the markdown, to make sure it looks just the way it should, nobody likes 10 blank rows before every bulletpoint. Using o1 to automate the boring parts One more interesting thing is that I’m experimenting with using an LLM to help me keep the README in sync with the new changes, and in general help me automate the boring parts of releasing a new version. ...

January 30, 2025 Â· 3 min

Building a simple agent with smolagents and Azure OpenAI

How to integrate smolagents with Azure OpenAI to build Python-driven AI agents. Also, lots of ducks.

January 20, 2025 Â· 5 min

Prompt Caching with Azure OpenAI

How Azure OpenAI’s prompt caching feature works, its benefits, caveats, and a quick experiment

January 12, 2025 Â· 9 min

Grabit, the Web Page Downloader

A web page downloader for humans and large language models alike

January 7, 2025 Â· 2 min

(Better) Dependency Injection in FastAPI

A bit of a rant on the state dependency injection in Python/FastAPI, and an implementation using the Injector and FastAPI-Injector libraries

December 15, 2024 Â· 5 min

Lessons Learned 2 - 8 December 2024

Interesting things I’ve learned in week 2 - 8 December 2024 (apart from the fact that democracy is fragile)

December 8, 2024 Â· 4 min

Vlad's Awesome Generative AI Compendium

Generative AI models I like

July 16, 2024 Â· 10 min