Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataverse should correctly categorize certain .spx files as geospatial files #6541

Closed
jggautier opened this issue Jan 17, 2020 · 2 comments
Closed

Comments

@jggautier
Copy link
Contributor

jggautier commented Jan 17, 2020

In Dataverse 4.18.1 (and possibly older versions), Dataverse is tagging geospatial files that have the extension .spx as OGG audio files (and the file previewer tries and fails to preview what it thinks is an audio file). If you go to the dataset at https://doi.org/10.7910/DVN/KKQFKA and filter for audio files, you'll find six files that are related to ArcGIS. I'm assuming Dataverse labels them as audio files because there is an audio file format (Speex Audio Compression Format) that shares the same file extension.

Is there a way for Dataverse to tell when an .spx file is an audio file and when it's related to ArcGIS?

@mheppler
Copy link
Contributor

Strange. I looked at some resources from recent file type facets work, and found .spx files labeled as Ogg Vorbis Speex File, but no reference there to ArcGIS software. However, a keyword search on that site found 80 file extensions with "ArcGIS" in the Programs field (e.g. .ATX ArcGIS Attribute Index File; .MXT ArcGIS Map Template). All that said, this other resource in fact does label them as ArcGIS Geodatabase SPX File.

For comparison's sake, it appears we have nearly 200 OGG Audio (.ogg) files in Harvard Dataverse which were harvested from our amigos at e-cienciaDatos.

We should also consult @landreev to look into why the new JHOVE library added in 4.15 (#2202) would be missing this identification.

There are loosely documented ways to correct mislabeling like this via the native API. But a nice friendly ArcGIS label would also require the addition of UI friendly file type labels like application/spx=ArcGIS Geodatabase SPX File added to MimeTypeDisplay.properties.

application/dbf=dBASE Table for ESRI Shapefile
application/dbase=dBASE Table for ESRI Shapefile
application/prj=ESRI Shapefile
application/sbn=ESRI Spatial Index
application/sbx=ESRI Spatial Index
application/shp=Shape
application/shx=Shape
application/zipped-shapefile=Shape

And don't forget the file category facet too in MimeTypeFacets.properties

# Shape
application/dbf=Shape
application/dbase=Shape
application/prj=Shape
application/sbn=Shape
application/sbx=Shape
application/shp=Shape
application/shx=Shape
application/zipped-shapefile=Shape

@cmbz
Copy link

cmbz commented Aug 20, 2024

To focus on the most important features and bugs, we are closing issues created before 2020 (version 5.0) that are not new feature requests with the label 'Type: Feature'.

If you created this issue and you feel the team should revisit this decision, please reopen the issue and leave a comment.

@cmbz cmbz closed this as completed Aug 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants