Today I was ables to swap out the pairing of OSM nodes and VSI stores. I’m now going over each VSI store and picking the closest OSM node with no threshold. Well, I am using a soft cap threshold of 1 km just to guarantee that the cross product of these two tables won’t explode, but no VSI store stays unmatched with that radius.
The analysis can be seen in this Data Studio dashboard.
As of now there are 140 good looking matches between OSM and VSI, basically half the dataset - much more than the initial count of 10%! I’ll go through these manually anyway just to confirm, because a few of them have a bit of a location discrepancy (>50 meters) and many have noticeable naming variations.
There are 138 non-matching pairs. There are a few of these that are just being missed by the name matching fuzziness logic, but most do seem to have significantly different names and locations. These will require a bit more survey work to sort out.