All Stories

GPU-accelerated ML inference in Vespa Cloud

Today we're introducing support for GPU-accelerated ONNX model inference in Vespa, together with support for GPU instances in Vespa Cloud!

Improving Search Ranking with Few-Shot Prompting of LLMs

Distilling the knowledge and power of generative Large Language Models (LLMs) with billions of parameters to ranking models with a few million parameters.

Vespa Newsletter, January 2023

Photo by Scott Graham on Unsplash

Vespa Newsletter, January 2023

Advances in Vespa features and performance include Better Tensor formats, AWS PrivateLink, Autoscaling, Data Plane Access Control and Container and Content Node Performance.

Improving Zero-Shot Ranking with Vespa Hybrid Search - part two

Where should you begin if you plan to implement search functionality but have not yet collected data from user interactions to train ranking models?

Improving Zero-Shot Ranking with Vespa Hybrid Search

If you are planning to implement search functionality but have not yet collected data from user interactions to train ranking models, where should you begin?

Improving Product Search with Learning to Rank - part three

Photo by Niels Weiss on Unsplash

Improving Product Search with Learning to Rank - part three

This is the third blog post on applying learning to rank to enhance E-commerce search.

Vespa Newsletter, November 2022

Photo by Ilya Pavlov on Unsplash

Vespa Newsletter, November 2022

Vespa features and performance advances include ANN Pre-Filter Performance, Parent Field Hit-Estimates, Model Training Notebooks, and GCP Support.

Improving Product Search with Learning to Rank - part two

This is the second blog post on applying learning to rank to enhance E-commerce search.

Vespa Cloud on Google Cloud Platform

Photo by NASA on Unsplash

Vespa Cloud on Google Cloud Platform

Vespa Cloud is now available on Google Cloud Platform

Improving Product Search with Learning to Rank - part one

This is the first blog post on applying learning to rank to enhance E-commerce search.

Vespa Newsletter, October 2022

Photo by Scott Graham on Unsplash

Vespa Newsletter, October 2022

Advances in Vespa features and performance include a BertBase Embedder / model hub, improved query performance, paged attributes, ARM64 support and term bolding in string arrays.

Building Billion-Scale Vector Search - part two

Searching billion-scale datasets without breaking the bank.