CofeAI
/

Tele-FLM-1T

Text Generation

Model card Files Files and versions

sleepylx commited on Jul 22, 2024

Commit

9060ff5

·

verified ·

1 Parent(s): 4391734

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -35,7 +35,7 @@ Based on growth technology, the Tele-FLM-1T model training is divided into three
 - SwiGLU for activation function
 - Linear bias disabled
 - Embedding and language model head untied
-- Input and output multiplication
 Consequently, Tele-FLM-1T is largely compatible with Llama architecturally.
 To maximize convenience for the community, we made minimal adjustments to Llama's code to adapt it to Tele-FLM and released it as open source.

 - SwiGLU for activation function
 - Linear bias disabled
 - Embedding and language model head untied
+- Input and output multiplier
 Consequently, Tele-FLM-1T is largely compatible with Llama architecturally.
 To maximize convenience for the community, we made minimal adjustments to Llama's code to adapt it to Tele-FLM and released it as open source.