Category: Uncategorized
-
You might not have heard of the term foundation model before, but you’ve almost certainly used one. In reference to LLMs (and AI in general), “foundation model” refers to a model that has been trained on vast swathes of data, such that it can be used in a general context. LLM foundation models are those…
-
RAG, or retrieval-augmented generation, is a technique that allows LLMs to access external sources of data. Normally, LLMs can only rely on the prompt fed to them, and the knowledge baked into their parameters. However, RAG vastly expands LLMs’ capabilities. Using these external sources of data, an LLM can incorporate it into its output. This…
-
What is Transfer Learning? Transfer learning is a machine learning technique where a model trained on one task is reused as a starting point for a different but related task. Instead of building and training a new model from scratch for every problem, transfer learning leverages the knowledge and features learned by a pre-trained model…
-
LoRA in a Nutshell Low-Rank Adaptation, or LoRA, is a cutting-edge technique designed to fine-tune large language models (LLMs) efficiently and effectively. Instead of modifying the entire massive model, LoRA adapts just a small fraction of parameters through lightweight additions, enabling rapid specialization without retraining from scratch or requiring excessive computational resources. The Core Idea:…
-
With all the buzz around OpenAI’s new Sora 2 video generation model, you might be wondering what makes it different from other previous SOTA models like Veo 3. Here’s the breakdown. Visual fidelity: Sora 2 has made improvements in visual fidelity, generating frames natively in 720p and then upscaling, to maintain sharp textures and object…
-
Core Building Blocks Why Agentic AI?
-
Agentic AI refers to AI systems that don’t just predict the next token or classify inputs – they perceive, plan, and act to achieve goals over time. Instead of passively answering questions, agentic systems take initiative: they break down objectives into steps, call the right tools and services, monitor progress, adapt to feedback, and iterate…
-
With the recent release of Genie 3, attention has once again been brought to world models. These are generative models that are trained to simulate and model the (not necessarily) real world! Most impressively, these are real-time simulations, sensitive to input. World models are able to understand the properties of the simulated world – such…
-
Have you seen this video? “This is gonna be scariest sound you’ll hear when they’re looking for you” “This is almost like two R2-D2’s having a conversation.” “Literally sounds like something out of a sci-fi horror with the AI looking for you hiding in the cupboard lol” What’s going on? Is Skynet upon us? What…
-
Why Do LLMs Hallucinate? LLM hallucinations stem from several inherent factors tied to how these models are developed and operate: Approaches to Mitigate Hallucinations While hallucinations cannot be entirely eliminated, various strategies help reduce their frequency and impact: