question

Upvotes
Accepted
1 0 1 2

Incomplete Entity Bulk Download Files

I downloaded the OpenPermID entity files from 10/20/2019. It appears that entities are missing, for example cannot find https://permid.org/1-4295907555 in the download file for organizations.

I downloaded again today the entity files from 10/27/2019. However, they seem more incomplete than the previous ones.

Org file uncompressed = 2GB on 10/20, 1.1GB on 10/27
Person file uncompressed = 7.2GB on 10/20, 6GB on 10/27

It appears from past messages that this is not the first time that this issue occurred, and that it reappears on a regular basis.

1) Could you please fix?
2) In the meantime, could it be possible to link not just to the latest file, but also to the latest COMPLETE file?
3) Can you please indicate how many entities there should be in each download file, so that we can check whether it's complete or not?

Thanks for your help, PermID is a great resource.

permid-apiintelligent-tagging-apiopen-permid-apiDownloadexport
icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Upvotes
Accepted
53.1k 138 44 63

@phil94

I found https://permid.org/1-4295907555 in the latest OpenPermID-bulk-organization-20191103_084707.ntriples file (03 Nov, 2019).


-bash-3.00$ grep 1-4295907555 OpenPermID-bulk-organization-20191103_084707.ntriples
<https://permid.org/1-4295907555> <http://permid.org/ontology/common/hasPermId> "4295907555"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://permid.org/1-4295907555> <http://ont.thomsonreuters.com/mdaas/RegisteredAddress> "Corporation Trust Center\n1209 Orange Street\nNew Castle County\nWILMINGTON\nDELAWARE\n19801\nUnited States\n"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://permid.org/1-4295907555> <http://permid.org/ontology/organization/hasPrimaryBusinessSector> <https://permid.org/1-4294952999> .
<https://permid.org/1-4295907555> <http://ont.thomsonreuters.com/mdaas/HeadquartersAddress> "5320 Legacy Dr\n\n\nPLANO\nTEXAS\n75024-3127\n"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://permid.org/1-4295907555> <http://www.omg.org/spec/EDMC-FIBO/BE/LegalEntities/CorporateBodies/isDomiciledIn> <http://sws.geonames.org/6252001/> .
<https://permid.org/1-4295907555> <http://permid.org/ontology/organization/hasIPODate> "1997-05-08T04:00:00Z"^^<http://www.w3.org/2001/XMLSchema#dateTime> .
<https://permid.org/1-4295907555> <http://permid.org/ontology/organization/hasLEI> "549300KCWA5W52MS5559"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://permid.org/1-4295907555> <http://www.w3.org/2006/vcard/ns#organization-name> "Denbury Resources Inc"^^<http://www.w3.org/2001/XMLSchema#string> .

Could you please recheck with the latest files and let me know the result?


icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.

Upvotes
1 0 1 2

Hi - this record was actually missing from the 20191020 TTL file. I did not check the n-triple files.

I downloaded today the TTL files from 20191110. I ran a couple of checks on the organization file and the person file, everything seems fine. Thanks for looking into the issue.

Before marking this as solved, I would like however to check the total number of unique PermIDs for persons/organizations in the TTL files and confirm that the files contain the expected number.

icon clock
10 |1500

Up to 2 attachments (including images) can be used with a maximum of 512.0 KiB each and 1.0 MiB total.