--- datasets: - OpenLLM-France/Lucie-Training-Dataset - permutans/wdc-common-crawl-embedded-jsonld - calabi-yau-data/ws-5d - mathmadness/MathCoder - AlgorithmicResearchGroup/arxiv_research_code - Vikhrmodels/programming_books_eng - cornstack/cornstack-python-v1 - HuggingFaceH4/MATH-500 - di-zhang-fdu/MATH500 - bezir/MATH-500-multilingual - cais/mmlu - TIGER-Lab/MMLU-Pro - CohereForAI/Global-MMLU - minimario/livecodebench-execute-v2 - akioi/livecodebench_1_to_4 - livecodebench/execution - MatrixStudio/Codeforces-Python-Submissions - DenCT/codeforces-problems-7k - evanellis/Codeforces-LLM-Generations - evanellis/Codeforces-LLM-Generations_with_completions - Maxwell-Jia/AIME_2024 - HuggingFaceH4/aime_2024 language: - en - fr - ar metrics: - accuracy pipeline_tag: text-classification ---
AI GIF
Hugging Face GitHub Email License
--- # EGen V1 ## 🎯 Performance Metrics | **Category** | **Performance** | **Details** | |-------------------|-------------------|---------------------------------| | Architecture | THL-150 | 14B Active Parameters | | MMLU Score | 72.3% | Knowledge & Reasoning | | MATH-500 | 92.3% | Advanced Mathematics | | LiveCodeBench | 65.1% | Code Generation | | Inference | 30-70 ms/token | RTX 3070 GPU, 4-bit quantized | --- ## 🛠️ Technical Stack ### Core Infrastructure - **Hardware**: RTX 3070 GPU, Google Cloud TPUs - **Architecture**: Custom THL-150 Transformer - **Training**: Distributed PyTorch FSDP, Horovod - **Optimization**: DeepSpeed, Flash Attention, LoRA ### Development Tools - **ML Frameworks**: ![PyTorch](https://img.shields.io/badge/PyTorch-EE4C2C?style=flat-square&logo=pytorch&logoColor=white) ![TensorFlow](https://img.shields.io/badge/TensorFlow-FF6F00?style=flat-square&logo=tensorflow&logoColor=white) - **Deployment**: ![ONNX](https://img.shields.io/badge/ONNX-005CED?style=flat-square&logo=onnx&logoColor=white) ![TensorRT](https://img.shields.io/badge/TensorRT-76B900?style=flat-square&logo=nvidia&logoColor=white) ![FastAPI](https://img.shields.io/badge/FastAPI-009688?style=flat-square&logo=fastapi&logoColor=white) - **Data Tools**: ![Pandas](https://img.shields.io/badge/Pandas-150458?style=flat-square&logo=pandas&logoColor=white) ![NumPy](https://img.shields.io/badge/NumPy-013243?style=flat-square&logo=numpy&logoColor=white) ![HuggingFace](https://img.shields.io/badge/HuggingFace-FFD43B?style=flat-square&logo=huggingface&logoColor=white) - **Monitoring**: ![W&B](https://img.shields.io/badge/W&B-FFBE00?style=flat-square&logo=weightsandbiases&logoColor=white) ![MLflow](https://img.shields.io/badge/MLflow-0194E2?style=flat-square&logo=mlflow&logoColor=white) - **Infrastructure**: ![Docker](https://img.shields.io/badge/Docker-2496ED?style=flat-square&logo=docker&logoColor=white) ![Kubernetes](https://img.shields.io/badge/Kubernetes-326CE5?style=flat-square&logo=kubernetes&logoColor=white) ![Google Cloud](https://img.shields.io/badge/Google%20Cloud-4285F4?style=flat-square&logo=google-cloud&logoColor=white) --- ## ✨ Key Features - 🧠 **Advanced NLP**: Context-aware processing, multi-language support. - 🚀 **High-Speed Inference**: 30-70 ms/token on RTX 3070. - 🔄 **Zero-Shot Learning**: Exceptional task adaptation. - 🛡️ **Enterprise Security**: Model watermarking & governance. - 🌐 **Multi-Modal Support**: Text, code, structured data. --- ## 📦 Resources & Support ### Documentation - [📖 Technical Docs](https://huggingface.co/ErebusTN/EGen_V1/blob/main/Documentation.md) ### Licensing - **Research & Academic Use**: Free with attribution. - **Commercial Use**: Requires a license. Contact [mouhebga62@gmail.com](mailto:mouhebga62@gmail.com). - **Modifications**: Allowed with proper attribution. --- ## 📜 License ### EGen V1 License The EGen V1 model is licensed under the following terms: - **Research & Academic Use**: Free for non-commercial use with proper attribution. - **Commercial Use**: Requires a license. Contact [mouhebga62@gmail.com](mailto:mouhebga62@gmail.com) for licensing details. - **Modifications**: Permitted, but derivative works must include the original license and attribution. #### Key Restrictions: - Redistribution of the model or its weights is prohibited without explicit permission. - Use in malicious or harmful applications is strictly forbidden. ---
**EGen V1 - Advancing AI Technology** [Documentation](https://huggingface.co/ErebusTN/EGen_V1/blob/main/Documentation.md)