Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deals must be manually restarted due to data transfer failures #4731

Closed
arajasek opened this issue Nov 5, 2020 · 8 comments
Closed

Deals must be manually restarted due to data transfer failures #4731

arajasek opened this issue Nov 5, 2020 · 8 comments
Labels
area/data-transfer area/markets/storage effort/hours Effort: Hours P2 P2: Should be resolved team/ignite Issues and PRs being tracked by Team Ignite at Protocol Labs

Comments

@arajasek
Copy link
Contributor

arajasek commented Nov 5, 2020

Edit: The original issue was about protecting connections lest we close them when cleaning up and interrupt a transfer. They should be unprotected when the transfer completes.
Edit: It seems that connections are already protected so the data transfer failures are being caused by a different issue. -@dirkmc

@arajasek arajasek added P2 P2: Should be resolved effort/minutes labels Nov 5, 2020
@arajasek
Copy link
Contributor Author

arajasek commented Nov 5, 2020

Based on some discussion with @Stebalien and @hannahhoward, this is already happening, yet @whyrusleeping reports that the majority of deals he tried needed data transfer to be restarted because the connection was closed from the remote (miner) side.

@Stebalien
Copy link
Member

Stebalien commented Nov 5, 2020 via email

@jennijuju jennijuju added the area/markets Area: Markets label Nov 11, 2020
@jennijuju jennijuju added this to the 💹Storage Deal Success milestone Nov 11, 2020
@dirkmc
Copy link
Contributor

dirkmc commented Nov 12, 2020

I spent some time today digging into this issue with @aarshkshah1992. It seems that:

In order to resolve the issue I think we will need more information about why exactly the connection is being closed.

@dirkmc dirkmc changed the title Mark data transfer peers in libp2p as protected Deals must be manually restarted due to data transfer failures Nov 12, 2020
@Stebalien
Copy link
Member

We believe the issue was yamux hitting a keepalive timeout. I believe this situation has gotten better since we fixed that bug.

@hannahhoward
Copy link
Contributor

also: side note: the timeouts on reconnect in graphsync may not be long enough to overcome libp2p's dialing backoff and we may need to increase them.

@hannahhoward
Copy link
Contributor

also I just realized the note about the disconnection is not resolved when we reconnect -- even if transfer resumes.

@dineshshenoy dineshshenoy added effort/hours Effort: Hours and removed effort/minutes labels Nov 16, 2020
@dineshshenoy dineshshenoy removed this from the 💹Storage Deal Success milestone Nov 25, 2020
@raulk raulk added the team/ignite Issues and PRs being tracked by Team Ignite at Protocol Labs label Mar 24, 2021
@dirkmc dirkmc mentioned this issue May 6, 2021
3 tasks
@dkkapur
Copy link
Contributor

dkkapur commented May 24, 2021

@dirkmc - fair to assume this can be closed with the fixes in markets v1.3?

@dkkapur dkkapur added this to the 🤝 Deal Success milestone May 24, 2021
@dirkmc
Copy link
Contributor

dirkmc commented May 31, 2021

Yes I'm closing, thanks Deep

@dirkmc dirkmc closed this as completed May 31, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/data-transfer area/markets/storage effort/hours Effort: Hours P2 P2: Should be resolved team/ignite Issues and PRs being tracked by Team Ignite at Protocol Labs
Projects
None yet
Development

No branches or pull requests

8 participants