close
test_template

Indexing Multidimensional Data for Nearest Neighbor Queries

download print

About this sample

About this sample

close

Words: 449 |

Page: 1|

3 min read

Published: Apr 11, 2019

Words: 449|Page: 1|3 min read

Published: Apr 11, 2019

Most previous works in the database literature has focused on indexing lower dimensional data and on other types of queries besides similarity queries. The lc-d tree was one of the first structures proposed for indexing multidimensional data for nearest neighbor queries. Recently, this structure has been used in geographic information systems for queries like similarity queries, and might be useful for similarity indexing. Other methods such as space filling curves, linear quad trees, and grid files, do not scale well to high dimensions, but may be useful for medium dimensional data.

'Why Violent Video Games Shouldn't Be Banned'?

The R-tree and its most successful variant, the R*-tree, have been used most often for indexing high dimensional data in the database literature. However, since ranges are stored on each dimension, the index requires more space and time to search in higher dimensionality. For this reason, higher dimensional data typically is mapped to a lower dimensional space before indexing in R-trees.

The TV-tree is the only method in the database literature thus far that has been proposed specifically for indexing high-dimensional data. Performance comparisons clearly show that the TV-tree can be much more efficient than the R*-tree. However, the improved performance depends on two assumptions. The first assumption is that dimensions and the feature vectors are ordered by “importance.” This second assumption is that sets of feature vectors in the dataset will tend to exactly match on dimensions, especially on the first few “important” dimensions.

The first assumption is reasonable (if not desirable) since an appropriate transform may be used. The second assumption was not explicitly stated, Ln the paper, but a careful analysis of their algorithms reveals that their performance improvement depends upon it. In some applications, the original feature vectors contain a small set of discrete quantities, so the second assumption does hold.

Unfortunately, this second assumption will normally not be true in visual information systems, and in many other applications. Features in these applications are typically real-valued, so that chances of exactly matching on dimensions are negligible. In this case, the TV-tree reduces to an index on only first few dimensions. Small changes in the proposed algorithms should allow the TV-tree to be a modest improvement over the R*-tree in these applications. However, in this paper, we will refer to the R-tree (and variants) as the best previously known structure for similarity indexing because it has proven itself in more similarity indexing applications.

Get a custom paper now from our expert writers.

There is also related work outside of the database literature. In the information retrieval literature, work has been done on cluster fides that proposes structures similar to the SS-tree. In the image database community, a static indexing structure based on Kohonen nets was suggested. There is also related work in the computational geometry and vector quantization literature.

Image of Alex Wood
This essay was reviewed by
Alex Wood

Cite this Essay

Indexing Multidimensional Data for Nearest Neighbor Queries. (2019, April 10). GradesFixer. Retrieved April 26, 2024, from https://gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/
“Indexing Multidimensional Data for Nearest Neighbor Queries.” GradesFixer, 10 Apr. 2019, gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/
Indexing Multidimensional Data for Nearest Neighbor Queries. [online]. Available at: <https://gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/> [Accessed 26 Apr. 2024].
Indexing Multidimensional Data for Nearest Neighbor Queries [Internet]. GradesFixer. 2019 Apr 10 [cited 2024 Apr 26]. Available from: https://gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/
copy
Keep in mind: This sample was shared by another student.
  • 450+ experts on 30 subjects ready to help
  • Custom essay delivered in as few as 3 hours
Write my essay

Still can’t find what you need?

Browse our vast selection of original essay samples, each expertly formatted and styled

close

Where do you want us to send this sample?

    By clicking “Continue”, you agree to our terms of service and privacy policy.

    close

    Be careful. This essay is not unique

    This essay was donated by a student and is likely to have been used and submitted before

    Download this Sample

    Free samples may contain mistakes and not unique parts

    close

    Sorry, we could not paraphrase this essay. Our professional writers can rewrite it and get you a unique paper.

    close

    Thanks!

    Please check your inbox.

    We can write you a custom essay that will follow your exact instructions and meet the deadlines. Let's fix your grades together!

    clock-banner-side

    Get Your
    Personalized Essay in 3 Hours or Less!

    exit-popup-close
    We can help you get a better grade and deliver your task on time!
    • Instructions Followed To The Letter
    • Deadlines Met At Every Stage
    • Unique And Plagiarism Free
    Order your paper now