Ingest gpt-fast benchmark results from S3 to Rockset #125891

huydhn · 2024-05-10T02:05:13Z

A follow-up of #125450, this extends the tools/stats/upload_dynamo_perf_stats.py script to upload arbitrary benchmark results in CSV format.

Upload gpt-fast benchmarks to a new Rockset collection benchmarks/oss_ci_benchmark. The file is in the following format:

$ cat test/test-reports/gpt_fast_benchmark.csv
name,mode,target,actual,percentage
Llama-2-7b-chat-hf,bfloat16,104,104.754128,100.73%

The CSV output needs to be kept in test/test-reports directory.
Re-use the existing .github/workflows/upload-test-stats.yml workflow

Testing

Run the commands manually

(py3.11) huydo@huydo-mbp pytorch % python3 -m tools.stats.upload_artifacts --workflow-run-id 9026179545 --workflow-run-attempt 1 --repo "pytorch/pytorch"
Using temporary directory: /var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp6eug3cdz
Downloading test-jsons-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip
Upload /private/var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp6eug3cdz/test-jsons-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip to s3://gha-artifacts/pytorch/pytorch/9026179545/1/artifact/test-jsons-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip
Downloading test-reports-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip
Upload /private/var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp6eug3cdz/test-reports-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip to s3://gha-artifacts/pytorch/pytorch/9026179545/1/artifact/test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip

(py3.11) huydo@huydo-mbp pytorch % python3 -m tools.stats.upload_dynamo_perf_stats --workflow-run-id 9026179545 --workflow-run-attempt 1 --repo "pytorch/pytorch" --head-branch "ciflow/inductor-micro-benchmark/125891" --rockset-collection oss_ci_benchmark --rockset-workspace benchmarks --match-filename "^gpt_fast_benchmark"
Using temporary directory: /var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp8xr4sdxk
Downloading test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip
Extracting test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip to unzipped-test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212
Processing gpt_fast_benchmark from test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip
Writing 3 documents to Rockset
Done!

Also run a sanity check on ingesting inductor benchmark results:

(py3.11) huydo@huydo-mbp pytorch % python -m tools.stats.upload_dynamo_perf_stats --workflow-run-id 8997654356 --workflow-run-attempt 1 --repo pytorch/pytorch --head-branch main --rockset-collection torch_dynamo_perf_stats --rockset-workspace inductor --match-filename "^inductor_"
...
Writing 4904 documents to Rockset
Done!

pytorch-bot · 2024-05-10T02:05:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/125891

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6bd0f3e with merge base 6c4f43f ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

huydhn · 2024-05-10T17:01:47Z

@pytorchbot drci

yanboliang

LGTM

huydhn · 2024-05-11T04:08:17Z

@pytorchbot merge

pytorchmergebot · 2024-05-11T04:11:11Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2024-05-11T04:13:43Z

@pytorchbot merge -f 'CI only, no need to run trunk job'

pytorchmergebot · 2024-05-11T04:14:01Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

pytorchmergebot · 2024-05-11T04:16:22Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

A follow-up of pytorch#125450, this extends the `tools/stats/upload_dynamo_perf_stats.py` script to upload arbitrary benchmark results in CSV format. * Upload gpt-fast benchmarks to a new Rockset collection `benchmarks/oss_ci_benchmark`. The file is in the following format: ``` $ cat test/test-reports/gpt_fast_benchmark.csv name,mode,target,actual,percentage Llama-2-7b-chat-hf,bfloat16,104,104.754128,100.73% ``` * The CSV output needs to be kept in `test/test-reports` directory. * Re-use the existing `.github/workflows/upload-test-stats.yml` workflow ### Testing Run the commands manually ``` (py3.11) huydo@huydo-mbp pytorch % python3 -m tools.stats.upload_artifacts --workflow-run-id 9026179545 --workflow-run-attempt 1 --repo "pytorch/pytorch" Using temporary directory: /var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp6eug3cdz Downloading test-jsons-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip Upload /private/var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp6eug3cdz/test-jsons-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip to s3://gha-artifacts/pytorch/pytorch/9026179545/1/artifact/test-jsons-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip Downloading test-reports-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip Upload /private/var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp6eug3cdz/test-reports-runattempt1-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip to s3://gha-artifacts/pytorch/pytorch/9026179545/1/artifact/test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip (py3.11) huydo@huydo-mbp pytorch % python3 -m tools.stats.upload_dynamo_perf_stats --workflow-run-id 9026179545 --workflow-run-attempt 1 --repo "pytorch/pytorch" --head-branch "ciflow/inductor-micro-benchmark/125891" --rockset-collection oss_ci_benchmark --rockset-workspace benchmarks --match-filename "^gpt_fast_benchmark" Using temporary directory: /var/folders/x4/2kd9r0fn5b9bf_sbcw16fxsc0000gn/T/tmp8xr4sdxk Downloading test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip Extracting test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip to unzipped-test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212 Processing gpt_fast_benchmark from test-reports-test-inductor-micro-benchmark-1-1-linux.gcp.a100_24803987212.zip Writing 3 documents to Rockset Done! ``` Also run a sanity check on ingesting inductor benchmark results: ``` (py3.11) huydo@huydo-mbp pytorch % python -m tools.stats.upload_dynamo_perf_stats --workflow-run-id 8997654356 --workflow-run-attempt 1 --repo pytorch/pytorch --head-branch main --rockset-collection torch_dynamo_perf_stats --rockset-workspace inductor --match-filename "^inductor_" ... Writing 4904 documents to Rockset Done! ``` Pull Request resolved: pytorch#125891 Approved by: https://github.com/yanboliang

Ingest gpt-fast benchmark results from S3 to Rockset

434c083

huydhn requested review from yanboliang and clee2000 May 10, 2024 02:05

huydhn requested a review from a team as a code owner May 10, 2024 02:05

pytorch-bot bot added ci-td-distributed release notes: releng release notes category labels May 10, 2024

huydhn added ciflow/inductor-micro-benchmark suppress-bc-linter Suppresses the failures of API backward-compatibility linter (Lint/bc_linter) labels May 10, 2024

huydhn requested a review from ZainRizvi May 10, 2024 17:02

yanboliang approved these changes May 10, 2024

View reviewed changes

Tweak the test report filename regex to accept dash

6bd0f3e

huydhn added test-config/default test-config/inductor-micro-benchmark labels May 10, 2024

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 11, 2024

pytorchmergebot added the merging label May 11, 2024

pytorchmergebot added the Merged label May 11, 2024

pytorchmergebot closed this in 9dee3ef May 11, 2024

pytorchmergebot removed the merging label May 11, 2024

huydhn mentioned this pull request May 21, 2024

Create a perf benchmark dashboard for gpt-fast pytorch/test-infra#5225

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ingest gpt-fast benchmark results from S3 to Rockset #125891

Ingest gpt-fast benchmark results from S3 to Rockset #125891

huydhn commented May 10, 2024 •

edited

pytorch-bot bot commented May 10, 2024 •

edited

huydhn commented May 10, 2024

yanboliang left a comment

huydhn commented May 11, 2024

pytorchmergebot commented May 11, 2024

huydhn commented May 11, 2024

pytorchmergebot commented May 11, 2024

pytorchmergebot commented May 11, 2024

Ingest gpt-fast benchmark results from S3 to Rockset #125891

Ingest gpt-fast benchmark results from S3 to Rockset #125891

Conversation

huydhn commented May 10, 2024 • edited

Testing

pytorch-bot bot commented May 10, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/125891

✅ No Failures

huydhn commented May 10, 2024

yanboliang left a comment

Choose a reason for hiding this comment

huydhn commented May 11, 2024

pytorchmergebot commented May 11, 2024

Merge started

huydhn commented May 11, 2024

pytorchmergebot commented May 11, 2024

pytorchmergebot commented May 11, 2024

Merge started

huydhn commented May 10, 2024 •

edited

pytorch-bot bot commented May 10, 2024 •

edited