{ }
Blog
Vespa.ai
Docs
Subscribe
Vespa Blog
We Make AI Work
Featured
Introducing layered ranking for RAG applications
Introducing layered ranking: The missing piece for context engineering at scale.
Perplexity builds AI Search at scale on Vespa.ai
Perplexity chose to build on Vespa.ai to provide the world's most used RAG application.
All Stories
Image from Gerd Altmann on Pixabay.
Eliminating the Precision–Latency Trade-Off in Large-Scale RAG
A look at three techniques that together eliminate this trade-off: multiphase ranking, layered retrieval and semantic chunking.
Image from Overearth on Shutterstock.
Why AI Search Platforms Are Gaining Attention
Users expect search not just to return accurate results, but to do the heavy lifting: Answer a question, summarize research, or even solve a problem.
Photo by
Ashraful Islam
on
Unsplash
Why Life Sciences AI Is a Search Problem (Part 5 of 5)
The future of GenAI in pharma and healthcare isn’t about building bigger models — it’s about smarter retrieval.
Photo by
Ousa Chea
on
Unsplash
Why Life Sciences AI Is a Search Problem (Part 4 of 5)
The future of GenAI in pharma and healthcare isn’t about building bigger models — it’s about smarter retrieval.
Vespa Newsletter, December 2025
Advances in Vespa features and performance include automated ANN tuning, accelerated vector distance calculations with Google Highway, precise chunk-level matching with enhanced sameElement, and expressive proximity queries with nested NEAR...
Photo by
National Cancer Institute
on
Unsplash
Why Life Sciences AI Is a Search Problem (Part 3 of 5)
The future of GenAI in pharma and healthcare isn’t about building bigger models — it’s about smarter retrieval.
Photo by
CHUTTERSNAP
on
Unsplash
Why Life Sciences AI Is a Search Problem (Part 2 of 5)
The future of GenAI in pharma and healthcare isn't about building bigger models - it's about smarter retrieval.
Photo by
Louis Reed
on
Unsplash
Why Life Sciences AI Is a Search Problem (Part 1 of 5)
The future of GenAI in pharma and healthcare isn't about building bigger models - it's about smarter retrieval.
Image from limpreom on Shutterstock
Beyond Vector Search: The Move to Tensor-Based Retrieval
Tensors preserve critical context, making them far better suited for advanced retrieval tasks where precision and explainability matter.
Image from Khanthachai C on Shutterstock.
Vector Search Is Reaching Its Limit. Here’s What Comes Next
As RAG applications evolve, they require richer data representations that capture relationships within and across modalities, like text, images and video.
🎄 Advent of Tensors 2025 🎅
We’re excited to announce Advent of Tensors 2025 — a 24-day coding challenge for anyone curious about tensors.
LLMs, Vespa, and a side of Summer Debugging
We built a standalone MCP server in Python, then rewrote it in Java for full Vespa integration — and lived to tell the tale.
« Prev
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
Next »
Never miss a
story
from us, subscribe to our newsletter
Subscribe