-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Exported file always empty data on Version 3.0.0 #1149
Comments
@dolsysmith Hi Dolsy, sorry to bother you. I found you upgraded the docker-compose.yml to v 3.0.0. Could you pls have a look at this issue? And could you also pls confirm if your exporter working as expected. Thank you very much! I notcied the
|
@fishfree Thanks for posting your docker logs. I am unfortunately unable to test this for myself, because I don't currently have a working Twitter v.2 API key. But I believe the issue stems from the twarc-csv package, which generates the CSV files from the Twitter JSON. Twitter has been rather relentlessly tweaking their API schema, and whenever they add or drop a field from the JSON, twarc-csv needs to be updated. I'm not sure whether the latest version of twarc-csv will handle this, but you could try the following:
With luck, when the container restarts, it will use the upgraded version of twarc-csv. If that doesn't work, you might try exporting the full JSON of the Tweets from SFM (since the full JSON export does not rely on twarc-csv) and using (the latest version of) twarc-csv outside the containers, at the command line, to convert the JSON to CSV. At the command line, you can even pass an argument to twarc-csv, as suggested by the error in the logs, which should correct for the issue: Eventually, I should have time to push a new release of SFM with the latest twarc-csv library in the Docker images. But that probably won't be for another month or so. In the meantime, I hope that helps! |
@dolsysmith Thank you very much for your tip! I rebuid the image locally with twarc-csv 0.7.2. It works now. |
@dolsysmith Sorry to bother you again. Now in a new server, I deployed sfm-docker with following docker-compose.yml:
Among these images, I built the
|
Hi @fishfree, my guess is that the Twitter data model has changed again, and that twarc-csv needs another update. Since there hasn't been another release since 0.7.2, I would recommend opening an issue on the twarc-csv repo. It might be possible to modify the SFM twitter-exporter code to check for these errors and respond accordingly; I'll keep this issue open as a reminder to look at this in a future sprint. Thanks for letting us know. |
@dolsysmith There is indeed |
I wouldn't be surprised if Twitter had not updated their official docs. I don't think the API is much of a priority for them right now. So yes, I imagine the API has changed, and that has broken the twarc-csv |
@dolsysmith Thank you! Then is there any alternative way to export harveested data as CSV files? |
@fishfree I would approach it in two steps.
|
Acturally I installed SFM UI the docker way. I install the 3.0.0 to adopt Twitter 2.0 API due to the termination of the free Twitter 1.1 API.
On the collection page, it shows results:
After exporting, it shows:
However, I open the exported file, always only has the headers, no content:
The text was updated successfully, but these errors were encountered: