All Stories
Photo by Sandro Katalina on Unsplash
GPU-accelerated ML inference in Vespa Cloud
Today we're introducing support for GPU-accelerated ONNX model inference in Vespa, together with support for GPU instances in Vespa Cloud!
Photo by Maxime VALCARCE on Unsplash
Improving Search Ranking with Few-Shot Prompting of LLMs
Distilling the knowledge and power of generative Large Language Models (LLMs) with billions of parameters to ranking models with a few million parameters.
Photo by Scott Graham on Unsplash
Vespa Newsletter, January 2023
Advances in Vespa features and performance include Better Tensor formats, AWS PrivateLink, Autoscaling, Data Plane Access Control and Container and Content Node Performance.
Photo by Tamarcus Brown on Unsplash
Improving Zero-Shot Ranking with Vespa Hybrid Search - part two
Where should you begin if you plan to implement search functionality but have not yet collected data from user interactions to train ranking models?
Photo by Norbert Braun on Unsplash
Improving Zero-Shot Ranking with Vespa Hybrid Search
If you are planning to implement search functionality but have not yet collected data from user interactions to train ranking models, where should you begin?
Photo by Niels Weiss on Unsplash
Improving Product Search with Learning to Rank - part three
This is the third blog post on applying learning to rank to enhance E-commerce search.
Photo by Ilya Pavlov on Unsplash
Vespa Newsletter, November 2022
Vespa features and performance advances include ANN Pre-Filter Performance, Parent Field Hit-Estimates, Model Training Notebooks, and GCP Support.
Photo by Carl Campbell on Unsplash
Improving Product Search with Learning to Rank - part two
This is the second blog post on applying learning to rank to enhance E-commerce search.
Vespa Cloud on Google Cloud Platform
Vespa Cloud is now available on Google Cloud Platform
Photo by Pawel Czerwinski on Unsplash
Improving Product Search with Learning to Rank - part one
This is the first blog post on applying learning to rank to enhance E-commerce search.
Photo by Scott Graham on Unsplash
Vespa Newsletter, October 2022
Advances in Vespa features and performance include a BertBase Embedder / model hub, improved query performance, paged attributes, ARM64 support and term bolding in string arrays.
Photo by julien Tromeur on Unsplash