AIM Intelligence

company

https://aim-intelligence.com

AIM-Intelligence

AI & ML interests

AI Safety & AI Security

Recent Activity

Dasool authored a paper 5 days ago

When Context Flips, Safety Breaks: Diagnosing Brittle Safety in Aligned Language Models

ysy970923 updated a model 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-NER-lora-v4

ysy970923 updated a model 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-extract-lora-v1

View all activity

Papers

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

View all Papers

authored a paper 5 days ago

When Context Flips, Safety Breaks: Diagnosing Brittle Safety in Aligned Language Models

Paper • 2605.27851 • Published 15 days ago • 1

updated 3 models 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-NER-lora-v4

Updated 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-extract-lora-v1

Updated 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-name-address-lora-v3

Updated 15 days ago

published 3 models 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-NER-lora-v4

Updated 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-extract-lora-v1

Updated 15 days ago

AIM-Intelligence/Guardian-v0.2-PII-name-address-lora-v3

Updated 15 days ago

updated a dataset 15 days ago

AIM-Intelligence/XL-SafetyBench

Viewer • Updated 15 days ago • 5.5k • 522 • 6

updated a dataset 28 days ago

AIM-Intelligence/fluid-reasoning-representation-phase1

Updated 28 days ago • 141

published a dataset 29 days ago

AIM-Intelligence/fluid-reasoning-representation-phase1

Updated 28 days ago • 141

authored a paper about 1 month ago

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

Paper • 2605.05662 • Published May 7 • 11

updated a collection about 1 month ago

XL-SafetyBench

A country-grounded cross-cultural benchmark for LLM safety and cultural sensitivity across 10 country-language pairs. • 2 items • Updated May 8 • 6

submitted a paper to Daily Papers about 1 month ago

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

Paper • 2605.05662 • Published May 7 • 11

authored a paper 5 months ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

submitted a paper to Daily Papers 5 months ago

What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models

Paper • 2601.06165 • Published Jan 7 • 16

authored 3 papers 5 months ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 24

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting

Paper • 2409.14747 • Published Sep 23, 2024

COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

Paper • 2601.01836 • Published Jan 5 • 10

authored a paper 6 months ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77