Fetch data for 24 days to stay within quota by hugovk · Pull Request #49 · hugovk/top-pypi-packages (original) (raw)

@hugovk Does Data processed account for actually fetched values too or could more columns be fetched?

It should account for the actually fetched values, because it's the result of the query sent. You can see a snippet in #42 which does more or less the same thing.

Would it be possible to enable a GitHub cron job to generate every month automatically?

docs.github.com/en/actions/writing-workflows/choosing-when-your-workflow-runs/events-that-trigger-workflows#schedule

on: schedule:

* is a special character in YAML so you have to quote this string

Yes, but there's already a cron running on Digital Ocean (see the README for details) that's meant to fetch the data each month automatically. Unfortunately the free quota is becoming too little and I need to adjust the amount fetched.

About the 2025.06 data: that also used up too much quota and so didn't complete. I'd meant to merge #50 before the 1st June, but I was travelling. I'll have to do a manual run instead.