Home — Essay Samples — Information Science and Technology — Big Data — Indexing Multidimensional Data for Nearest Neighbor Queries

Indexing Multidimensional Data for Nearest Neighbor Queries

Categories: Big Data Media

Human-Written

About this sample

Human-Written

Words: 449 |

Page: 1|

3 min read

Updated: 16 November, 2024

Words: 449|Page: 1|3 min read

Updated: 16 November, 2024

Historical Context of Multidimensional Indexing
Introduction to TV-tree
Assumptions Affecting Performance
Limitations in Visual Information Systems
Related Works Beyond Database Literature

Most previous works in the database literature has focused on indexing lower dimensional data and on other types of queries besides similarity queries. The lc-d tree was one of the first structures proposed for indexing multidimensional data for nearest neighbor queries. Recently, this structure has been used in geographic information systems for queries like similarity queries, and might be useful for similarity indexing. Other methods such as space filling curves, linear quad trees, and grid files, do not scale well to high dimensions, but may be useful for medium dimensional data.

Historical Context of Multidimensional Indexing

The R-tree and its most successful variant, the R*-tree, have been used most often for indexing high dimensional data in the database literature. However, since ranges are stored on each dimension, the index requires more space and time to search in higher dimensionality. For this reason, higher dimensional data typically is mapped to a lower dimensional space before indexing in R-trees.

Introduction to TV-tree

The TV-tree is the only method in the database literature thus far that has been proposed specifically for indexing high-dimensional data. Performance comparisons clearly show that the TV-tree can be much more efficient than the R*-tree. However, the improved performance depends on two assumptions. The first assumption is that dimensions and the feature vectors are ordered by “importance.” This second assumption is that sets of feature vectors in the dataset will tend to exactly match on dimensions, especially on the first few “important” dimensions.

Assumptions Affecting Performance

The first assumption is reasonable (if not desirable) since an appropriate transform may be used. The second assumption was not explicitly stated, Ln the paper, but a careful analysis of their algorithms reveals that their performance improvement depends upon it. In some applications, the original feature vectors contain a small set of discrete quantities, so the second assumption does hold.

Limitations in Visual Information Systems

Unfortunately, this second assumption will normally not be true in visual information systems, and in many other applications. Features in these applications are typically real-valued, so that chances of exactly matching on dimensions are negligible. In this case, the TV-tree reduces to an index on only first few dimensions. Small changes in the proposed algorithms should allow the TV-tree to be a modest improvement over the R*-tree in these applications. However, in this paper, we will refer to the R-tree (and variants) as the best previously known structure for similarity indexing because it has proven itself in more similarity indexing applications.

Related Works Beyond Database Literature

There is also related work outside of the database literature. In the information retrieval literature, work has been done on cluster fides that proposes structures similar to the SS-tree. In the image database community, a static indexing structure based on Kohonen nets was suggested. There is also related work in the computational geometry and vector quantization literature.

Case Study On Big Data Ecosystem At Linkedin

The Importance Of Database Management In Organization Or Society

This essay was reviewed by

Alex Wood

More about our Team

Cite this Essay

Indexing Multidimensional Data for Nearest Neighbor Queries. (2019, April 10). GradesFixer. Retrieved July 5, 2025, from https://gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/

“Indexing Multidimensional Data for Nearest Neighbor Queries.” GradesFixer, 10 Apr. 2019, gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/

Indexing Multidimensional Data for Nearest Neighbor Queries. [online]. Available at: <https://gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/> [Accessed 5 Jul. 2025].

Indexing Multidimensional Data for Nearest Neighbor Queries [Internet]. GradesFixer. 2019 Apr 10 [cited 2025 Jul 5]. Available from: https://gradesfixer.com/free-essay-examples/indexing-multidimensional-data-for-nearest-neighbor-queries/

copy

Keep in mind: This sample was shared by another student.

450+ experts on 30 subjects ready to help
Custom essay delivered in as few as 3 hours

Get high-quality help

Meadow

Verified writer

Expert in: Information Science and Technology Business

4.9

(340 reviews)

“She did such a phenomenal job on this assignment! She completed it prior to its deadline and was thorough and informative”

+120 experts online

Hire writer

Learn the cost and time for your paper

Paper Topic

Deadline: in 10 days

Number of pages

Email Invalid email

By clicking “Check Writers’ Offers”, you agree to our terms of service and privacy policy. We’ll occasionally send you promo and account related email

"You must agree to out terms of services and privacy policy"

Get an estimate

No need to pay just yet!

Remember! This is just a sample.

You can get your custom paper by one of our expert writers.

Get custom essay

121 writers online

Still can’t find what you need?

Browse our vast selection of original essay samples, each expertly formatted and styled

Indexing Multidimensional Data for Nearest Neighbor Queries

Table of contents

Historical Context of Multidimensional Indexing

Introduction to TV-tree

Assumptions Affecting Performance

Limitations in Visual Information Systems

Related Works Beyond Database Literature

Cite this Essay

Still can’t find what you need?

Get Your
Personalized Essay in 3 Hours or Less!

Indexing Multidimensional Data for Nearest Neighbor Queries

Table of contents

Historical Context of Multidimensional Indexing

Introduction to TV-tree

Assumptions Affecting Performance

Limitations in Visual Information Systems

Related Works Beyond Database Literature

Cite this Essay

Related Essays

Still can’t find what you need?

Related Essays on Big Data

Related Topics

Get Your Personalized Essay in 3 Hours or Less!

Get Your
Personalized Essay in 3 Hours or Less!