A powerful and versatile family of Arabic Large Language Models (LLMs) designed for a wide range of tasks.
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA
From RAG to Agentic RAG for Faithful Islamic Question Answering
This collection focuses on Islamic religious resources, Islamic media ethics and other relevant content.
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
This collection contains resources and a specialized family of models for analyzing news and social media content in a multilingual context.
-
QCRI/LlamaLens-Arabic
Viewer • Updated • 1.17M • 451 • 2 -
QCRI/LlamaLens-English
Viewer • Updated • 1.39M • 617 • 3 -
QCRI/LlamaLens-Hindi
Viewer • Updated • 140k • 165 • 1 -
LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content
Paper • 2410.15308 • Published • 3
MultimodalXplain is a collection of resources and tools focused on the explanation and interpretation of text, speech and multimodal AI models.
Models for the paper entitled: "LLMxCPG: Context-Aware Vulnerability Detection Through Code Property Graph-Guided Large Language Models"
Material for our paper entitled: "TechniqueRAG: Retrieval Augmented Generation for Adversarial Technique Annotation in Cyber Threat Intelligence Text"
HumAID: Human-Annotated Disaster Incidents Data from Twitter with Deep Learning Benchmarks
Datasets & models for factuality, checkworthiness, media bias, propaganda/persuasion, and hate -- across text, image, and multimodal LLM settings.
A curated collection of speech resources, including datasets, code, models, and more, designed to support speech processing research and development.
Datasets and models for our paper: "From Text to Actionable Intelligence: Automating STIX Entity and Relationship Extraction"
A Series of Models used for the paper entitled: "Semantic Ranking for Automated Adversarial Technique Annotation in Security Text"
-
QCRI/SentSecBert_10k
Sentence Similarity • Updated • 45 • 2 -
QCRI/SentSecBert_10k_AllDataSplit
Sentence Similarity • Updated • 4 -
QCRI/monot5_AllDataSplit
Sentence Similarity • Updated • 2 -
Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
Paper • 2403.17068 • Published
CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing
A powerful and versatile family of Arabic Large Language Models (LLMs) designed for a wide range of tasks.
Datasets & models for factuality, checkworthiness, media bias, propaganda/persuasion, and hate -- across text, image, and multimodal LLM settings.
This collection focuses on Islamic religious resources, Islamic media ethics and other relevant content.
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
This collection contains resources and a specialized family of models for analyzing news and social media content in a multilingual context.
-
QCRI/LlamaLens-Arabic
Viewer • Updated • 1.17M • 451 • 2 -
QCRI/LlamaLens-English
Viewer • Updated • 1.39M • 617 • 3 -
QCRI/LlamaLens-Hindi
Viewer • Updated • 140k • 165 • 1 -
LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content
Paper • 2410.15308 • Published • 3
A curated collection of speech resources, including datasets, code, models, and more, designed to support speech processing research and development.
MultimodalXplain is a collection of resources and tools focused on the explanation and interpretation of text, speech and multimodal AI models.
Datasets and models for our paper: "From Text to Actionable Intelligence: Automating STIX Entity and Relationship Extraction"
Models for the paper entitled: "LLMxCPG: Context-Aware Vulnerability Detection Through Code Property Graph-Guided Large Language Models"
A Series of Models used for the paper entitled: "Semantic Ranking for Automated Adversarial Technique Annotation in Security Text"
-
QCRI/SentSecBert_10k
Sentence Similarity • Updated • 45 • 2 -
QCRI/SentSecBert_10k_AllDataSplit
Sentence Similarity • Updated • 4 -
QCRI/monot5_AllDataSplit
Sentence Similarity • Updated • 2 -
Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
Paper • 2403.17068 • Published
Material for our paper entitled: "TechniqueRAG: Retrieval Augmented Generation for Adversarial Technique Annotation in Cyber Threat Intelligence Text"
CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing
HumAID: Human-Annotated Disaster Incidents Data from Twitter with Deep Learning Benchmarks