Writings
Learnings from ingesting millions of technical pages for RAG on Azure.
Learnings from ingesting millions of technical pages for RAG on Azure. Context overview This document outlines insights of an engagement with one of our…
Showcase: Azure AI Hybrid Search unexpected results gotcha
Showcase: Azure AI Hybrid Search unexpected results gotcha This document describes a gotcha in Azure AI Search hybrid queries where unexpected results are…
Encoding hidden prompt in LLMs as potential attack vector.
The recent publication on LLM "sleeper agents" prompted me to re-explore influencing LLMs to pursue alternative objectives. In this case, without alterting…
What does a statement like “AI will take my job” look like in practice?
What does a statement like “AI will take my job” look like in practice? On two different occasions and coincidentally, I have been approached with the question…
GPT-4 CLI with persistence in 10 lines of code.
A short one: I needed a GPT-4 CLI interface (that's a RAS Syndrome), but most options seemed quite cluttered, and I like code-golf. Here an implementation of…
Voice record daily thoughts, redact with GPT4, and save to Apple Notes using Shortcuts.
The why Recording my daily thoughts and notes is a challenge I return to every few months, and so far, I hadn't figured out a system which works well for me. On…
Evaluating RAG/LLMs in highly technical settings using synthetic QA generation
In short: The RAG pattern for LLMs can be evaluated using QA pairs. Creating a "golden" dataset is expensive, but an auto-generated "silver" dataset can help…
On automating unit tests with LLMs.
Dear colleagues, I have a confession to make: I have been delegating some of my unit tests to my Jr. engineer, Gary-Pete Truman. First off, you might wonder why…
78% MNIST accuracy using GZIP in under 10 lines of code.
GZIP Addendum after hitting the HN frontpage: MNIST is a straightforward dataset, and higher accuracies are possible with various methods. The novelty of this…
From Concept to practice: Learnings from LLMs for Enterprise Production – Part 0
Disclaimer: I am an employee at Microsoft ISE. However, the views and opinions are of my own. We have recently engaged in an architecture design & upskilling…
One approach to achieving self-governing AI today
Note: I will not discuss the alignment issue or responsible AI / ethics. A self-governing AI is capable of solving any challenge in the digital domain within…
What defines great software: solving my problem in under 5 minutes with Tailscale
TL;DR: I had been avoiding properly setting up remote networking to my homeserver. Tailscale solved my problem in under 5 minutes. Today, at 4:30pm, I decided I…