SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction Paper • 2509.00581 • Published 7 days ago • 3
AMBEDKAR-A Multi-level Bias Elimination through a Decoding Approach with Knowledge Augmentation for Robust Constitutional Alignment of Language Models Paper • 2509.02133 • Published 4 days ago • 2
PRvL: Quantifying the Capabilities and Risks of Large Language Models for PII Redaction Paper • 2508.05545 • Published 30 days ago • 2
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations Paper • 2508.04939 • Published about 1 month ago • 2
Investigating Hallucination in Conversations for Low Resource Languages Paper • 2507.22720 • Published Jul 30 • 6
AlignGuard-LoRA: Alignment-Preserving Fine-Tuning via Fisher-Guided Decomposition and Riemannian-Geodesic Collision Regularization Paper • 2508.02079 • Published Aug 4 • 2
TRACEALIGN -- Tracing the Drift: Attributing Alignment Failures to Training-Time Belief Sources in LLMs Paper • 2508.02063 • Published Aug 4 • 1
MOD-X: A Modular Open Decentralized eXchange Framework proposal for Heterogeneous Interoperable Artificial Agents Paper • 2507.04376 • Published Jul 6 • 3
RADIANT: Retrieval AugmenteD entIty-context AligNmenT -- Introducing RAG-ability and Entity-Context Divergence Paper • 2507.02949 • Published Jun 28
QuickSilver -- Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization Paper • 2506.22396 • Published Jun 27
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2 • 61
Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives Paper • 2506.18116 • Published Jun 22
Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact Paper • 2507.00951 • Published Jul 1 • 22
Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images Paper • 2506.22960 • Published Jun 28 • 6
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations Paper • 2506.13901 • Published Jun 16 • 3
AdversariaL attacK sAfety aLIgnment(ALKALI): Safeguarding LLMs through GRACE: Geometric Representation-Aware Contrastive Enhancement- Introducing Adversarial Vulnerability Quality Index (AVQI) Paper • 2506.08885 • Published Jun 10
Human-Readable Adversarial Prompts: An Investigation into LLM Vulnerabilities Using Situational Context Paper • 2412.16359 • Published Dec 20, 2024
Can Large Language Models Infer Causal Relationships from Real-World Text? Paper • 2505.18931 • Published May 25 • 1
Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods Paper • 2505.17870 • Published May 23 • 5