How to configure aider and Continue with o3-mini and DeepSeek-R1 deployed in Azure AI Foundry

A step-by-step guide to configure aider and Continue with Azure-hosted o3-mini and DeepSeek-R1 LLMs for AI-assisted development

February 5, 2025 Â· 4 min

Clipit (previously Grabit) 0.7 released, and how to use an LLM to keep the README in sync with the code

In the meantime, I’ve renamed Grabit to Clipit to prevent PyPI naming collisions – see more details here. I’ve just released v0.7 of Grabit, my little command line app for saving full-text copies of webpages. It brings support for saving Reddit posts (I really wanted to do this), and custom user agents (I didn’t really want to do this, but here we are). It also prettifies the markdown, to make sure it looks just the way it should, nobody likes 10 blank rows before every bulletpoint. ...

January 30, 2025 Â· 3 min

Building a simple agent with smolagents and Azure OpenAI

How to integrate smolagents with Azure OpenAI to build Python-driven AI agents. Also, lots of ducks.

January 20, 2025 Â· 5 min

Prompt Caching with Azure OpenAI

How Azure OpenAI’s prompt caching feature works, its benefits, caveats, and a quick experiment

January 12, 2025 Â· 9 min

Clipit (previously Grabit), the Web Page Downloader

A web page downloader for humans and large language models alike

January 7, 2025 Â· 2 min

(Better) Dependency Injection in FastAPI

A bit of a rant on the state dependency injection in Python/FastAPI, and an implementation using the Injector and FastAPI-Injector libraries

December 15, 2024 Â· 5 min

Lessons Learned 2 - 8 December 2024

Interesting things I’ve learned in week 2 - 8 December 2024 (apart from the fact that democracy is fragile)

December 8, 2024 Â· 4 min

Vlad's Awesome Generative AI Compendium

Generative AI models I like

July 16, 2024 Â· 10 min

Fine-Tuning AI Models: Comparing the Costs of OpenAI vs Azure OpenAI

Understand the differences in pricing between Azure OpenAI and OpenAI for fine-tuning AI models, with a detailed analysis of token and hosting costs.

July 1, 2024 Â· 7 min

How I've Used Whisper to Transcribe, GPT-4 to Summarize, DALL*E to Illustrate, and Text-to-speech to Narrate OpenAI's DevDay Keynote

I heard you like OpenAI, so I used OpenAI’s Whisper to transcribe the OpenAI DevDay Keynote, OpenAI GPT-4 Turbo to summarize the transcript, come up with ideas that illustrate the main points and generate DALL-E prompts for said ideas, OpenAI DALL·E 3 to generate the images, and OpenAI Text to Speech to narrate the summary. Xzibit would be like, so proud.

February 10, 2024 Â· 14 min