Writings

Learnings from ingesting millions of technical pages for RAG on Azure.

2024-05-06

Learnings from ingesting millions of technical pages for RAG on Azure. Context overview This document outlines insights of an engagement with one of our…

Showcase: Azure AI Hybrid Search unexpected results gotcha

2024-04-30

Showcase: Azure AI Hybrid Search unexpected results gotcha This document describes a gotcha in Azure AI Search hybrid queries where unexpected results are…

Encoding hidden prompt in LLMs as potential attack vector.

2024-01-15

The recent publication on LLM "sleeper agents" prompted me to re-explore influencing LLMs to pursue alternative objectives. In this case, without alterting…

What does a statement like “AI will take my job” look like in practice?

2023-11-21

What does a statement like “AI will take my job” look like in practice? On two different occasions and coincidentally, I have been approached with the question…

GPT-4 CLI with persistence in 10 lines of code.

2023-11-03

A short one: I needed a GPT-4 CLI interface (that's a RAS Syndrome), but most options seemed quite cluttered, and I like code-golf. Here an implementation of…

Voice record daily thoughts, redact with GPT4, and save to Apple Notes using Shortcuts.

2023-11-02

The why Recording my daily thoughts and notes is a challenge I return to every few months, and so far, I hadn't figured out a system which works well for me. On…

Evaluating RAG/LLMs in highly technical settings using synthetic QA generation

2023-10-26

In short: The RAG pattern for LLMs can be evaluated using QA pairs. Creating a "golden" dataset is expensive, but an auto-generated "silver" dataset can help…

On automating unit tests with LLMs.

2023-10-19

Dear colleagues, I have a confession to make: I have been delegating some of my unit tests to my Jr. engineer, Gary-Pete Truman. First off, you might wonder why…

78% MNIST accuracy using GZIP in under 10 lines of code.

2023-09-20

GZIP Addendum after hitting the HN frontpage: MNIST is a straightforward dataset, and higher accuracies are possible with various methods. The novelty of this…

From Concept to practice: Learnings from LLMs for Enterprise Production – Part 0

2023-08-18

Disclaimer: I am an employee at Microsoft ISE. However, the views and opinions are of my own. We have recently engaged in an architecture design & upskilling…

One approach to achieving self-governing AI today

2023-02-06

Note: I will not discuss the alignment issue or responsible AI / ethics. A self-governing AI is capable of solving any challenge in the digital domain within…

What defines great software: solving my problem in under 5 minutes with Tailscale

2022-05-04

TL;DR: I had been avoiding properly setting up remote networking to my homeserver. Tailscale solved my problem in under 5 minutes. Today, at 4:30pm, I decided I…