Case Study

    Green Vectors vs Elastic BBQ: A Head-to-Head Benchmark

    Morphos AI benchmarked Green Vectors against Elastic's Better Binary Quantization (BBQ), one of the most widely used vector compression methods, using the complete Project Gutenberg dataset. The benchmark measured three configurations: Green Vectors alone, BBQ alone, and Green Vectors combined with BBQ.

    Over 2.1x
    Higher search accuracy
    Up to 77%
    Faster search queries
    99%
    Reduction in storage footprint
    ConfigurationStorageRelevancy
    BBQ alone175 GB.4576
    Green Vectors alone1.5 GB.9658
    Green Vectors with BBQ2.6 GB.9653

    Relevancy score: closer to 1 is more accurate.

    The challenge

    A next-generation vectorization technology must be tested against established industry solutions. Elastic BBQ is a powerful compression method for large-scale vector workloads. The benchmark measured not only storage but the often-overlooked metrics of accuracy and latency, at the scale of the full Project Gutenberg corpus.

    The results

    Green Vectors alone achieved 1.5GB of storage at a .9658 relevancy score. BBQ alone required 175GB at a .4576 relevancy score. That makes Green Vectors roughly 116 times more storage-efficient and more than twice as accurate as BBQ on this benchmark.

    Layering, in context

    Green Vectors and quantization operate on different axes, so they can be layered. In this benchmark, however, Green Vectors alone was the optimal configuration. Combining Green Vectors with BBQ held relevancy at .9653 but used 2.6GB, more storage than Green Vectors alone at 1.5GB, because on an already-minimal index BBQ's rotation and correction overhead costs more than it saves. The practical takeaway is that Green Vectors delivers the efficiency quantization aims for, which makes a separate quantization step optional.

    Why this matters

    The key finding is that Green Vectors achieves its efficiency while improving accuracy, where compression alone sacrifices accuracy to save space. Green Vectors eliminates redundancy at the source rather than degrading precision.

    FAQ

    Frequently asked questions.

    On the Project Gutenberg benchmark, Green Vectors alone was roughly 116x more storage-efficient and more than twice as accurate as BBQ alone. They can also be combined.
    They can be layered, since they work on different axes, but the benchmark showed Green Vectors alone was the optimal configuration. Adding BBQ increased storage without improving relevancy. With Green Vectors, quantization becomes optional.
    No. BBQ compresses by lowering precision. Green Vectors reduces the number of vectors by eliminating redundancy, preserving precision.

    Related

    See Green Vectors on your data

    Request Kitana beta access