I try to design my app to find database entries which are similar.
Let’s for example take the table car (Everything in one table to keep the example simple):
CarID | Car Name | Brand | Year | Top Speed | Performance | Displacement | Price
1 Z3 BMW 1990 250 5.4 123 23456
2 3er BMW 2000 256 5.4 123 23000
3 Mustang Ford 2000 190 9.8 120 23000
Now i want to do Queries like that:
“Search for Cars similar to Z3 (all brands)” (ignore “Car Name”)
Similar in this context means that the row where the most columns are exactly the same is the most similar.
In this example it would be “3er BMW” since 2 columns(Performance and Displacement are the same)
Can you give me hints how to design database queries/application like that. The application gonna be really big with a lot of entries.
Also I would really appreciate useful links or books. (No problem for me to investigate further if i know where to search or what to read)
You could try to give each record a ‘score’ depending on its fields
You could weigh a column’s score depending on how important the property is for the comparison (for instance top speed could be more important than brand)
You’ll end up with a score for each record, and you will be able to find similar records by comparing scores and finding the records that are +/- 5% (for example) of the record you’re looking at