Hi, I found many incorrect wikipedia/wikidata tags to Spanish wiki. They point to "disambiguation pages". Also, lets discuss the guidelines for wiki tags (see below). Thanks!
My original post to firstname.lastname@example.org:
TLDR: researching ways to validate wikipedia and wikidata tags, wrote a
script to cross-check OSM and Wikidata, found many incorrect disambig
references, would love to start community discussion on best guidelines
I have been analyzing the quality of OSM's wikipedia and wikidata tags by
cross-checking data using both OSM tags and Wikidata. My first goal is to
fix "disambiguation" references - when OSM object links to the Wikipedia
disambiguation page, instead of the real location page. I have already
fixed about 200 objects, but there are about 800+ relations left, and I
could really use some help. I don't think its possible to add them to
MapRoulette just yet.
While fixing wd/wp tagging issues, I have been putting together a list of
open questions on how we want to improve wikipedia and wikidata tags in
general, and create some guidelines. Lets discuss them in the talk page?
https://www.mediawiki.org/wiki/User:Yur … _questions
Lastly, if you have any suggestions on different ways to validate data
using the mixture of Wikidata and OSM, let me know. At the moment I have a
list of all types of OSM objects' wikidata IDs, and mark the bad ones with
a value. If OSM's wikidata's "instance of" of one of the bad types, my
script puts those OSM objects it into a separate list that I can analyze.
The list of types is here - sort by the second column:
https://commons.wikimedia.org/wiki/Data … nceofs.tab
Feel free to modify the second value of any row to indicate that those
objects should be fixed.
Thank you for the information. I will try to disambiguate the tags corresponding to the Spanish Wikipedia.
I have looked at your list, and for some of them seems that were not ambiguous when the data was added to OSM, and becomes ambiguous later, when Wikipedia changed the pages. ¿Have you seen those cases?
I'm not sure if something can be done in OSM to avoid it. Only taking care of the data with periodic cross checks, and also checks for recently added wiki tags.