Skip to content

Issues: databricks/megablocks

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Bad throughput with GLU
#110 opened May 17, 2024 by Muennighoff
1-expert worse than dense model
#107 opened May 8, 2024 by Muennighoff
Add a fine-tune script for JetMoE
#105 opened Apr 17, 2024 by shamanez
ScatterMoE feature
#104 opened Apr 5, 2024 by ehartford
support amd/rocm enhancement New feature or request help wanted Extra attention is needed
#97 opened Mar 21, 2024 by ehartford
AMP + BF16 failing
#95 opened Jan 28, 2024 by jramapuram
selective router precision question Further information is requested
#91 opened Jan 14, 2024 by 152334H
Does this framework support SFT? question Further information is requested
#90 opened Jan 12, 2024 by banksy23
RuntimeError: Triton Error [CUDA]: invalid argument question Further information is requested
#88 opened Jan 10, 2024 by noob-ctrl
How to integrate to transformers-based mixtral question Further information is requested
#84 opened Jan 3, 2024 by nxphi47
ParallelDroplessMLP initialises self.mlp twice enhancement New feature or request help wanted Extra attention is needed
#83 opened Jan 1, 2024 by 152334H
Script for Full Fine-Tuning of Mixtral question Further information is requested
#68 opened Dec 20, 2023 by alpayariyak
ProTip! Adding no:label will show everything without a label.