{ }

Vespa Blog

We Make AI Work

Posts by Jo Kristian Bergum

Elasticsearch vs Vespa Performance Comparison

Detailed report of a comprehensive performance comparison between Vespa and Elasticsearch for an e-commerce search application.

Jo Kristian Bergum

Geir Storli Jo Kristian Bergum Radu Gheorghe
06 Nov 2024

Scaling ColPali to billions of PDFs with Vespa

Scaling Vision-Driven Document Retrieval with ColPali to large collections.

Jo Kristian Bergum
20 Sep 2024

Beyond Text: The Rise of Vision-Driven Document Retrieval for RAG

Imagine a world where search engines see documents with human-like vision.

Jo Kristian Bergum
19 Aug 2024

Small but Mighty: Using Answer.ai's ColBERT embedding model in Vespa

Using Answer.ai's ColBERT-small model for efficient and effective passage search

Jo Kristian Bergum
14 Aug 2024

PDF Retrieval with Vision Language Models

Connecting the ColPali model with Vespa for complex document format retrieval.

Jo Kristian Bergum
15 Jul 2024

Adaptive In-Context Learning 🤝 Vespa - part one

Adaptive In-Context Learning (ICL) with Vespa to retrieve context-sensitive examples

Jo Kristian Bergum
11 Jul 2024

Improving retrieval with LLM-as-a-judge

How to create your own reusable retrieval evaluation dataset for your data and use it to assess your retrieval system's effectiveness

Jo Kristian Bergum
03 Jul 2024

Matryoshka 🤝 Binary vectors: Slash vector search costs with Vespa

Announcing Matryoshka (dimension flexibility) and binary quantization in Vespa and how these features slashes costs.

Jo Kristian Bergum
22 Apr 2024

Perspectives on R in RAG

In this blog post, I share perspectives on the R in RAG.

Jo Kristian Bergum
22 Mar 2024

Scaling vector search using Cohere binary embeddings and Vespa

Three comprehensive guides to using the Cohere Embed v3 binary embeddings with Vespa.

Jo Kristian Bergum
21 Mar 2024

Announcing Vespa Long-Context ColBERT

Announcing long-context ColBERT, giving it larger context for scoring and simplifying long-document RAG applications.

Jo Kristian Bergum
01 Mar 2024

Announcing the Vespa ColBERT embedder

Announcing the native Vespa ColBERT embedder in Vespa, enabling explainable semantic search using token-level vector representations

Jo Kristian Bergum
14 Feb 2024

Redefining Hybrid Search Possibilities with Vespa - part one

This is the first blog post in a series on hybrid search. This first post focuses on efficient hybrid retrieval and representational approaches in IR...

Jo Kristian Bergum
19 Jan 2024

Turbocharge RAG with LangChain and Vespa Streaming Mode for Sharded Data

A hands-on guide to connect LangChain with Vespa streaming mode to build cost-efficient RAG applications over naturally sharded data.

Jo Kristian Bergum
07 Dec 2023

🎄Advent of Tensors 2023 🎅

Prepare to embark on a festive journey as we bring you the Advent of Tensors!

Andreas Eriksen

Jo Kristian Bergum

Andreas Eriksen Jo Kristian Bergum
30 Nov 2023

Hands-On RAG guide for personal data with Vespa and LLamaIndex

A hands-on guide to using Vespa streaming mode with PyVespa and LLamaIndex.

Jo Kristian Bergum
30 Nov 2023

Announcing search.vespa.ai

A new search experience for Vespa-related content - powered by Vespa, LangChain, and OpenAI’s chatGPT model - our motivation for building it, features, limitations, and...

Jo Kristian Bergum

Leandro Alves Kristian Aune Jo Kristian Bergum Valerij Fredriksen
08 Sep 2023

Representing BGE embedding models in Vespa using bfloat16

This post demonstrates how to use recently announced BGE embedding models in Vespa. We evaluate the effectiveness of two BGE variants on the BEIR trec-covid...

Jo Kristian Bergum
10 Aug 2023

Accelerating Transformer-based Embedding Retrieval with Vespa

In this post, we’ll see how to accelerate embedding inference and retrieval with little impact on quality. We’ll take a holistic approach and deep-dive into...

Jo Kristian Bergum
08 Aug 2023

Simplify Search with Multilingual Embedding Models

This blog post presents and shows how to represent a robust multilingual embedding model of the E5 family in Vespa.

Jo Kristian Bergum
24 Jul 2023

Enhancing Vespa’s Embedding Management Capabilities

We are thrilled to announce significant updates to Vespa’s support for inference with text embedding models that maps texts into vector representations.

Jo Kristian Bergum

Jo Kristian Bergum Bjørn C Seime
14 Jun 2023

Minimizing LLM Distraction with Cross-Encoder Re-Ranking

Announcing global-phase re-ranking support in Vespa, unlocking efficient re-ranking with precise cross-encoder models. Cross-encoder models minimize distraction in retrieval-augmented completions generated by Large Language Models....

Jo Kristian Bergum

Bjørn C Seime Arne H Juul Jo Kristian Bergum
08 May 2023

Customizing Reusable Frozen ML-Embeddings with Vespa

Deep-learned embeddings are popular for search and recommendation use cases. This post introduces the concept of using reusable frozen embeddings and tailoring them with Vespa....

Jo Kristian Bergum
31 Mar 2023

Revolutionizing Semantic Search with Multi-Vector HNSW Indexing in Vespa

Announcing multi-vector indexing support in Vespa, which allows you to index multiple vectors per document and retrieve documents by the closest vector in each document....

Jo Kristian Bergum

Geir Storli Tor Egge Jo Kristian Bergum
29 Mar 2023

Improving Search Ranking with Few-Shot Prompting of LLMs

Distilling the knowledge and power of generative Large Language Models (LLMs) with billions of parameters to ranking models with a few million parameters.

Jo Kristian Bergum
03 Feb 2023

Improving Zero-Shot Ranking with Vespa Hybrid Search - part two

Where should you begin if you plan to implement search functionality but have not yet collected data from user interactions to train ranking models?

Jo Kristian Bergum
09 Jan 2023

Improving Zero-Shot Ranking with Vespa Hybrid Search

If you are planning to implement search functionality but have not yet collected data from user interactions to train ranking models, where should you begin?...

Jo Kristian Bergum
05 Jan 2023

Improving Product Search with Learning to Rank - part three

This is the third blog post on applying learning to rank to enhance E-commerce search.

Jo Kristian Bergum
01 Dec 2022

Improving Product Search with Learning to Rank - part two

This is the second blog post on applying learning to rank to enhance E-commerce search.

Jo Kristian Bergum
10 Nov 2022

Improving Product Search with Learning to Rank - part one

This is the first blog post on applying learning to rank to enhance E-commerce search.

Jo Kristian Bergum
03 Nov 2022

Building Billion-Scale Vector Search - part two

Searching billion-scale datasets without breaking the bank.

Jo Kristian Bergum
26 Oct 2022

Building Billion-Scale Vector Search - part one

How fast is fast? Many consider the blink of an eye, around 100-250ms, to be plenty fast.

Jo Kristian Bergum
06 Oct 2022

Will new vector databases dislodge traditional search engines?

Doug Turnbull asks an interesting question on Linkedin; Will new vector databases dislodge traditional search engines?

Jo Kristian Bergum
19 Sep 2022

Managed Vector Search using Vespa Cloud

This blog post describes how your organization can unlock the full potential of multimodal AI-powered vector representations using Vespa -- the industry-leading open-source big data...

Jo Kristian Bergum
01 Jul 2022

Billion-scale vector search using hybrid HNSW-IF

This blog post describes HNSW-IF, a cost-efficient solution for high-accuracy vector search over billion scale vector datasets.

Jo Kristian Bergum
08 Jun 2022

Query Time Constrained Approximate Nearest Neighbor Search

This blog post describes Vespa's industry leading support for combining approximate nearest neighbor search, or vector search, with query constraints to solve real-world search and...

Jo Kristian Bergum

Geir Storli Jo Kristian Bergum
20 May 2022

Billion-scale vector search with Vespa - part two

Part two in a blog post series on billion-scale vector search with Vespa. This post explores the many trade-offs related to nearest neighbor search.

Jo Kristian Bergum
27 Jan 2022

Billion-scale vector search with Vespa - part one

Part one in a blog post series on billion-scale vector search. This post covers using nearest neighbor search with compact binary representations and bitwise hamming...

Jo Kristian Bergum
01 Dec 2021

Result diversification using Vespa result grouping

This blog post dives into how to achieve result diversification using Vespa's grouping framework.

Jo Kristian Bergum
11 Nov 2021

Pretrained Transformer Language Models for Search - part 4

This is the fourth blog post in a series of posts where we introduce using pretrained Transformer models for search and document ranking with Vespa.ai....

Jo Kristian Bergum
22 Jun 2021

Pretrained Transformer Language Models for Search - part 3

This is the third blog post in a series of posts where we introduce using pretrained Transformer models for search and document ranking with Vespa.ai....

Jo Kristian Bergum
31 May 2021

Pretrained Transformer Language Models for Search - part 2

This is the second blog post in a series of posts where we introduce using pretrained Transformer models for search and document ranking with Vespa.ai....

Jo Kristian Bergum
28 May 2021

Pretrained Transformer Language Models for Search - part 1

This is the first blog post in a series of posts where we introduce using pretrained Transformer models for search and document ranking with Vespa.ai....

Jo Kristian Bergum
19 May 2021

Using approximate nearest neighbor search to find similar products

Approximate nearest neighbor search demonstration using Amazon Product dataset

Jo Kristian Bergum
16 Feb 2021

E-commerce search and recommendation with Vespa.ai

Holiday shopping season is upon us and it’s time for a blog post on E-commerce search and recommendation using Vespa.ai. Vespa.ai is used as the...

Jo Kristian Bergum
29 Nov 2019

Never miss a story from us, subscribe to our newsletter