Tokenizer change has strange order plus format
2
#21 opened 11 months ago
by
Qubitium

Training DBRX-like model
3
#13 opened 11 months ago
by
nguyenthanhdo
Release weights of smaller Experimental MoE
2
#12 opened 11 months ago
by
shahules786

Train datasets?
1
#11 opened 11 months ago
by
danielpark
Errors During Training for the Original Implementation and the Fixes for the Errors
#10 opened 11 months ago
by
v2ray

performance of DBRX in TEXT SUMMARIZATON and GRAMMAR CORRECTION
1
#9 opened 11 months ago
by
CharanAI

Please, authorize access for the base weight!
44
#5 opened 11 months ago
by
Undi95
