Gupta, Prashant ; Jindal, Aashi ; Jayadeva, a ; Sengupta, Debarka (2021) ComBI: Compressed Binary Search Tree for Approximate k-NN Searches in Hamming Space Big Data Research, 25 . p. 100223. ISSN 2214-5796
Full text not available from this repository.
Official URL: https://doi.org/10.1016/j.bdr.2021.100223
Related URL: http://dx.doi.org/10.1016/j.bdr.2021.100223
Abstract
The space-partitioning based hashing techniques are widely used to represent high-dimensional data points as bit-codes. Although Binary Search Trees (BSTs) can be used for storing bit-codes, their size grows exponentially with code length. In practice, such a tree turns out to be highly sparse, increasing the miss-rate of nearest neighbor searches. We present Compressed BST of Inverted hash tables (ComBI), a geometrically motivated compression technique for BSTs. ComBI enables fast and approximate nearest neighbor searches without a significant memory footprint over BSTs. We show that approximate search in ComBI can perform competitively to an exact search algorithm in retrieving the nearest neighbors in a Hamming space. On a database containing ∼80M samples, ComBI yields an average precision of 0.90, at ∼4X - ∼296X improvements in run-time across different code lengths when compared to MIH, a widely used exact search method. On a database consisting of 1B samples, this value of precision (0.90) is reached at ∼4X - ∼19X improvements in run-time.
| Item Type: | Article |
|---|---|
| Source: | Copyright of this article belongs to Elsevier B.V. |
| ID Code: | 142527 |
| Deposited On: | 24 Jan 2026 11:31 |
| Last Modified: | 24 Jan 2026 11:31 |
Repository Staff Only: item control page

