Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add uv parser to linehaul #2568

Closed
MichaReiser opened this issue Mar 20, 2024 · 5 comments · Fixed by pypi/linehaul-cloud-function#162
Closed

Add uv parser to linehaul #2568

MichaReiser opened this issue Mar 20, 2024 · 5 comments · Fixed by pypi/linehaul-cloud-function#162
Assignees
Labels
enhancement New feature or improvement to existing functionality

Comments

@MichaReiser
Copy link
Member

#2493 updated uv to include additional metadata in the requests sent to PyPI (or any package index). I tried to query the PyPI big data table but no uv records are showing up yet. I believe this is due to the fact that linehaul doesn't recognize uv because it doesn't have a uv parser configuration in

https://github.com/pypi/linehaul-cloud-function/blob/main/linehaul/ua/parser.py

We should contribute a uv parser upstream (copy PIPs configuration?)

@samypr100
Copy link
Collaborator

I was planning on adding one over the next few weeks as time permits unless someone wants to give it a shot.

@konstin konstin added the enhancement New feature or improvement to existing functionality label Mar 20, 2024
@charliermarsh
Copy link
Member

Nice, thanks for putting that PR up @samypr100.

@samypr100
Copy link
Collaborator

Seems like stats are coming in @MichaReiser @charliermarsh

uv_stats

@MichaReiser
Copy link
Member Author

Yes. Thanks for implementing it. It works like a charm.

Oh that's an interesting query! I so far only tracked downloads by uv version (our users are really up to date), and downloads overall.

@samypr100
Copy link
Collaborator

Yea, I already ate my free tier 😆

Here's one for the top 10 downloaded packages using uv in the last day. Nice to see that pandas came out at the top when using uv.

#standardSQL
SELECT
  file.project AS project_name,
  COUNT(*) AS num_downloads
FROM
  `bigquery-public-data.pypi.file_downloads`
WHERE
  DATE(timestamp) BETWEEN DATE_SUB(CURRENT_DATE(), INTERVAL 1 DAY) AND CURRENT_DATE()
  AND details.installer.name = 'uv'
GROUP BY
  project_name
ORDER BY
  num_downloads DESC
LIMIT 10

uv

project_name num_downloads
boto3 1130187
botocore 516449
packaging 320548
setuptools 303500
typing-extensions 295516
pandas 289491
urllib3 274027
requests 268572
pytest 267713
click 255924

pip

project_name num_downloads
boto3 51247219
botocore 22762858
urllib3 19826296
requests 17587337
wheel 16842258
idna 15655970
certifi 15591192
setuptools 15371830
charset-normalizer 15286527
typing-extensions 14930478

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or improvement to existing functionality
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants