DJ Sri Vigneshwar's picture

DJ Sri Vigneshwar

Sri-Vigneshwar-DJ

AI & ML interests

Currently building Hawky.ai - Creative Intelligence for Performance Marketing

Recent Activity

Organizations

AI FILMS's profile picture GEM benchmark's profile picture MusicAI's profile picture OpenVINO Toolkit's profile picture Open-Source AI Meetup's profile picture East China Normal University's profile picture AI Zero to Hero's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Blog-explorers's profile picture AI Tamil Nadu's profile picture LocalLLaMA's profile picture MLX Community's profile picture C4AI Community's profile picture M4-ai's profile picture Chinese LLMs on Hugging Face's profile picture Paris AI Running Club's profile picture Hawky.ai - The Creative Analytics Platform's profile picture Hawky.ai - Fine-tuned Language and Creative Generation Models (MarTech) 's profile picture Intelligent Estate's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture Arracle AI's profile picture

Sri-Vigneshwar-DJ's activity

posted an update about 1 month ago
view post
Post
687
Checkout phi-4 from Microsoft, dropped a day ago... If you ❤️ the Phi series, then here is the GGUF - Sri-Vigneshwar-DJ/phi-4-GGUF. phi-4 is a 14B highly efficient open LLM that beats much larger models at math and reasoning - check out evaluations on the Open LLM.

Technical paper - https://arxiv.org/pdf/2412.08905 ; The Data Synthesis approach is interesting
reacted to cfahlgren1's post with 🔥 about 1 month ago
view post
Post
1759
Wow, I just added Langfuse tracing to the Deepseek Artifacts app and it's really nice 🔥

It allows me to visualize and track more things along with the cfahlgren1/react-code-instructions dataset.

It was just added as a one click Docker Space template, so it's super easy to self host 💪
posted an update about 2 months ago
view post
Post
2080
Just sharing a thought: I started using DeepSeek V3 a lot, and an idea struck me about agents "orchestrating during inference" on a test-time compute model like DeepSeek V3 or the O1 series.

Agents (Instruction + Function Calls + Memory) execute during inference, and based on the output decision, a decision is made to scale the time to reason or perform other tasks.
posted an update about 2 months ago
view post
Post
2348
Combining smolagents with Anthropic’s best practices simplifies building powerful AI agents:

1. Code-Based Agents: Write actions as Python code, reducing steps by 30%.
2. Prompt Chaining: Break tasks into sequential subtasks with validation gates.
3. Routing: Classify inputs and direct them to specialized handlers.
4. Fallback: Handle tasks even if classification fails.

https://huggingface.co/blog/Sri-Vigneshwar-DJ/building-effective-agents-with-anthropics-best-pra