View Single Post
Posts: 1,414 | Thanked: 7,547 times | Joined on Aug 2016 @ Estonia
#734
Originally Posted by BGK View Post
As I'm always trying to learn: Could you explain
  • what transliteration.dat is for and how could it be corrupted?
  • what the other issues are about?
Sure. transliteration.dat is one of the datasets used by libpostal to parse addresses. For illustrations of what does it do, go to https://github.com/openvenues/libpos...les-of-parsing and scroll down to normalization as well.

fun fact: OSMSS was, to my knowledge, the first on mobile to use this library.

we use libpostal as a core for geocoder - the library that finds coordinates on the basis of your string. out of these datasets used by libpostal, transliteration is, as far as I can guess, used to make from Привет to privet. As a result, you can search for addresses even if you don't have the same chars on your keyboard.


Issues are many:

https://github.com/rinigus/osmscout-server/issues
https://github.com/rinigus/geocoder-nlp/issues
https://github.com/rinigus/mapbox-gl-styles/issues

to name few places.
 

The Following 3 Users Say Thank You to rinigus For This Useful Post: