Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Spike] Scale OSSF Scorecards prescriptions out of GitHub for aggregation by revision #31968

Open
mayaCostantini opened this issue Oct 27, 2022 · 2 comments
Labels
kind/feature Categorizes issue or PR as related to a new feature. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/stack-guidance Categorizes an issue or PR as relevant to SIG Stack Guidance.

Comments

@mayaCostantini
Copy link
Contributor

Is your feature request related to a problem? Please describe.
As we will start aggregating Scorecards prescriptions by project repository revision as present in the new scorecards-v2 BigQuery dataset and possibly create those prescriptions for packages from other ecosystems, we should think about a more scalable solution to have this data available.
The current size of the prescriptions dataset is currently of ~500M, which will largely exceed the recommended GitHub limit of 5GiB for a repository and cause storage and performance issues.

Describe the solution you'd like
Set up a new database (possibly non-relational) or make new Scorecards prescriptions available in a S3 bucket accessed through a webservice.

Additional context
Related to thoth-station/core#440

@mayaCostantini mayaCostantini added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 27, 2022
@mayaCostantini
Copy link
Contributor Author

/sig stack-guidance
/priority important-soon

@sesheta sesheta added sig/stack-guidance Categorizes an issue or PR as relevant to SIG Stack Guidance. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Oct 27, 2022
@mayaCostantini
Copy link
Contributor Author

@mayaCostantini mayaCostantini removed their assignment Dec 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/feature Categorizes issue or PR as related to a new feature. priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/stack-guidance Categorizes an issue or PR as relevant to SIG Stack Guidance.
Projects
Status: 🆕 New
Development

No branches or pull requests

2 participants