Skip to content

Actions: openai/evals

Actions

All workflows

Actions

Loading...

Showing runs from all workflows
2,179 workflow runs
2,179 workflow runs
Event

Filter by event

Status

Filter by status

Branch
Actor

Filter by actor

Add support for gpt-4o
Run unit tests #1753: Pull request #1530 synchronize by androettop
May 30, 2024 20:20 Action required androettop:main
May 30, 2024 20:20 Action required
Fix problematic sample in Schelling Point
Run unit tests #1752: Pull request #1534 opened by JunShern
May 22, 2024 23:04 8m 5s jun/schellingpoint-fix
May 22, 2024 23:04 8m 5s
Fix problematic sample in Schelling Point
Run new evals #2270: Pull request #1534 opened by JunShern
May 22, 2024 23:04 4m 38s jun/schellingpoint-fix
May 22, 2024 23:04 4m 38s
Update README: Add Langtrace as an Eval vendor
Run unit tests #1751: Pull request #1531 opened by karthikscale3
May 21, 2024 04:11 Action required karthikscale3:add-langtrace-to-readme
May 21, 2024 04:11 Action required
[eval] Add IMO problems with exact answers
Run unit tests #1750: Pull request #1528 reopened by justinlinw
May 17, 2024 20:43 Action required justinlinw:justinlinw/imo_solutions_only
May 17, 2024 20:43 Action required
[eval] Add IMO problems with exact answers
Run new evals #2269: Pull request #1528 reopened by justinlinw
May 17, 2024 20:43 Action required justinlinw:justinlinw/imo_solutions_only
May 17, 2024 20:43 Action required
Add support for gpt-4o
Run unit tests #1749: Pull request #1530 synchronize by androettop
May 17, 2024 20:35 Action required androettop:main
May 17, 2024 20:35 Action required
Add support for gpt-4o
Run unit tests #1744: Pull request #1530 synchronize by androettop
May 17, 2024 12:50 Action required androettop:main
May 17, 2024 12:50 Action required
Add support for gpt-4o
Run unit tests #1743: Pull request #1530 synchronize by androettop
May 17, 2024 12:21 Action required androettop:main
May 17, 2024 12:21 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run new evals #2264: Pull request #1511 synchronize by sakher
May 17, 2024 08:35 Action required sakher:quran-eval
May 17, 2024 08:35 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1742: Pull request #1511 synchronize by sakher
May 17, 2024 08:35 Action required sakher:quran-eval
May 17, 2024 08:35 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run new evals #2263: Pull request #1511 synchronize by sakher
May 17, 2024 04:08 Action required sakher:quran-eval
May 17, 2024 04:08 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1741: Pull request #1511 synchronize by sakher
May 17, 2024 04:08 Action required sakher:quran-eval
May 17, 2024 04:08 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run new evals #2262: Pull request #1511 synchronize by sakher
May 17, 2024 03:30 Action required sakher:quran-eval
May 17, 2024 03:30 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1740: Pull request #1511 synchronize by sakher
May 17, 2024 03:30 Action required sakher:quran-eval
May 17, 2024 03:30 Action required
Support GPT-4o, Added Quran Eval & Simple Fact Model-Graded Definition
Run new evals #2261: Pull request #1511 synchronize by sakher
May 17, 2024 03:27 Action required sakher:quran-eval
May 17, 2024 03:27 Action required
Add support for gpt-4o
Run unit tests #1738: Pull request #1530 opened by androettop
May 16, 2024 20:57 Action required androettop:main
May 16, 2024 20:57 Action required
eval pattern-concat-logic
Run unit tests #1735: Pull request #1508 synchronize by natanaelwf
May 9, 2024 13:18 3m 55s natanaelwf:pattern-concat-logic
May 9, 2024 13:18 3m 55s
eval pattern-concat-logic
Run new evals #2258: Pull request #1508 synchronize by natanaelwf
May 9, 2024 13:18 2m 25s natanaelwf:pattern-concat-logic
May 9, 2024 13:18 2m 25s
Release 3.0.1 (#1525)
Run unit tests #1733: Commit d3dc890 pushed by etr2460
May 1, 2024 00:50 4m 10s main
May 1, 2024 00:50 4m 10s
Release 3.0.1
Run unit tests #1732: Pull request #1525 opened by etr2460
May 1, 2024 00:24 3m 59s release/3.0.1
May 1, 2024 00:24 3m 59s
Make the torch dep optional (#1524)
Run unit tests #1731: Commit 1d3f11c pushed by etr2460
May 1, 2024 00:14 10m 41s main
May 1, 2024 00:14 10m 41s