Most Recent Posts
Vector Databases Explained: Search in era of AI - 16 March 2025
2024
June 2024
A Guide to LLM Inference (Part 4): Speculative Decoding & Batching - 2 June 2024
May 2024
A Guide to LLM Inference (Part 3): Model Compression - 19 May 2024
A Guide to LLM Inference (Part 2): Attention Optimisation - 5 May 2024
April 2024
A Guide to LLM Inference (Part 1): Foundations - 21 April 2024
A brief Introduction to LLMOps - 7 April 2024
March 2024
Fine-Tuning Pre Trained Models - 24 March 2024
An Introduction to the Transformer Architecture (Part 2) - 10 March 2024
February 2024
An Introduction to the Transformer Architecture (Part 1) - 25 February 2024
Hello World - 11 February 2024