ComBI: Compressed Binary Search Tree for Approximate k-NN Searches in Hamming Space

Gupta, Prashant ; Jindal, Aashi ; Jayadeva, a ; Sengupta, Debarka (2021) ComBI: Compressed Binary Search Tree for Approximate k-NN Searches in Hamming Space Big Data Research, 25 . p. 100223. ISSN 2214-5796

Full text not available from this repository.

Official URL: https://doi.org/10.1016/j.bdr.2021.100223

Related URL: http://dx.doi.org/10.1016/j.bdr.2021.100223

Abstract

The space-partitioning based hashing techniques are widely used to represent high-dimensional data points as bit-codes. Although Binary Search Trees (BSTs) can be used for storing bit-codes, their size grows exponentially with code length. In practice, such a tree turns out to be highly sparse, increasing the miss-rate of nearest neighbor searches. We present Compressed BST of Inverted hash tables (ComBI), a geometrically motivated compression technique for BSTs. ComBI enables fast and approximate nearest neighbor searches without a significant memory footprint over BSTs. We show that approximate search in ComBI can perform competitively to an exact search algorithm in retrieving the nearest neighbors in a Hamming space. On a database containing ∼80M samples, ComBI yields an average precision of 0.90, at ∼4X - ∼296X improvements in run-time across different code lengths when compared to MIH, a widely used exact search method. On a database consisting of 1B samples, this value of precision (0.90) is reached at ∼4X - ∼19X improvements in run-time.

Item Type:Article
Source:Copyright of this article belongs to Elsevier B.V.
ID Code:142527
Deposited On:24 Jan 2026 11:31
Last Modified:24 Jan 2026 11:31

Repository Staff Only: item control page