Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_2048_epoch_1 Text Generation • 4B • Updated 3 days ago • 25
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_2048_epoch_1 Text Generation • 4B • Updated 3 days ago • 25
Ujan/lts_DeepMath-103K_samples_10000_seq_16384_Qwen3-30B-A3B-Thinking-2507_22_23_24_0.8 Viewer • Updated 29 days ago • 11k • 12
Ujan/lts_DeepMath-103K_samples_10000_seq_16384_Qwen3-30B-A3B-Thinking-2507_22_23_24_0.8 Viewer • Updated 29 days ago • 11k • 12
Ujan/lts_pruned_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_17_18_19_0.5 Viewer • Updated 29 days ago • 11k • 14
Ujan/lts_pruned_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_17_18_19_0.5 Viewer • Updated 29 days ago • 11k • 14
Ujan/lts_pruned_processed_DeepMath-103K_samples_50000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 30 days ago • 51k • 12
Ujan/lts_pruned_processed_DeepMath-103K_samples_50000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 30 days ago • 51k • 12
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.8 Viewer • Updated about 1 month ago • 11k • 10
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.8 Viewer • Updated about 1 month ago • 11k • 10
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated about 1 month ago • 11k • 7
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated about 1 month ago • 11k • 7
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1 Text Generation • 4B • Updated Nov 26 • 3
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1 Text Generation • 4B • Updated Nov 26 • 3