Anonymous

Main Page: Difference between revisions

From geokb
Line 356: Line 356:
}
}
LIMIT 100
LIMIT 100
</sparql>
=== Duplicate Pubs Warehouse IDs ===
The USGS Publications Warehouse is our primary source for items representing scientific publications (USGS reports and journal articles). We tapped this source via a web service to build out a baseline of all USGS Numbered Series reports and all journal articles who had contributors identified with an ORCID (meaning we could reasonably establish those entities in the GeoKB and build in linkages). In doing this work, we discovered a number of cases where different USGS Pubs Warehouse records document the same publication. In these cases, we recorded multiple indexId values in the GeoKB items. The following query pulls out those cases for examination.
<sparql tryit="1">
PREFIX wd: <https://geokb.wikibase.cloud/entity/>
PREFIX wdt: <https://geokb.wikibase.cloud/prop/direct/>
SELECT ?item ?itemLabel
(GROUP_CONCAT(DISTINCT ?indexId; SEPARATOR=", ") AS ?indexIds)
(GROUP_CONCAT(DISTINCT ?doi; SEPARATOR=", ") AS ?dois)
WHERE {
  ?item wdt:P114 ?indexId .
  OPTIONAL {
    ?item wdt:P74 ?doi .
  }
  SERVICE wikibase:label { bd:serviceParam wikibase:language "en" . }
}
GROUP BY ?item ?itemLabel
HAVING (COUNT(?indexId) > 1)
</sparql>
</sparql>