Nasjonalbiblioteket AI Lab

Team

non-profit

Verified

https://ai.nb.no/

NbAiLab

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

pere updated a collection 2 days ago

NB-ASR-BETA

pere published a model 2 days ago

NbAiLab/nb-asr-beta1-Qwen06B-reading-optimised

pere updated a model 2 days ago

NbAiLab/nb-asr-beta1-Qwen06B-reading-optimised

View all activity

NbAiLab 's collections 12

NB-ASR-BETA

Beta testing resources in the NB-ASR project

NbAiLab/nb-asr-beta1-Qwen06B-reading-optimised

Automatic Speech Recognition • 0.8B • Updated 1 day ago • 53 • 2

NB-Whisper

Models based on Whisper from OpenAI, and trained on data from Språkbanken and the digital collection at the National Library of Norway.

NbAiLab/nb-whisper-large-distil-turbo-beta

Automatic Speech Recognition • 0.8B • Updated Sep 10, 2025 • 4.7k • 8
NbAiLab/nb-whisper-large

Automatic Speech Recognition • 2B • Updated Jul 13, 2024 • 8.93k • 37
NbAiLab/nb-whisper-medium

Automatic Speech Recognition • 0.8B • Updated Feb 13, 2024 • 1.28k • 4
NbAiLab/nb-whisper-small

Automatic Speech Recognition • 0.2B • Updated Feb 13, 2024 • 807 • 1

NB-Llama 3.x Quant

Quantized version of the NB-Llama 3.x models. Due to hardware issues, we have still not been able to make quantized version of the 70B models.

NbAiLab/nb-llama-3.2-1B-Q4_K_M-GGUF

Text Generation • 1B • Updated Dec 11, 2024 • 9
NbAiLab/nb-llama-3.2-3B-Q4_K_M-GGUF

Text Generation • 3B • Updated Dec 11, 2024 • 9
NbAiLab/nb-llama-3.1-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated Dec 11, 2024 • 52 • 1
NbAiLab/nb-llama-3.2-1B-Instruct-Q4_K_M-GGUF

Text Generation • 1B • Updated Dec 11, 2024 • 28

NB-Wav2Vec

Models based on Wav2Vec from Meta, and trained on data from Språkbanken and the digital collection at the National Library of Norway.

Boosting Norwegian Automatic Speech Recognition

Paper • 2307.01672 • Published Jul 4, 2023 • 1
NbAiLab/nb-wav2vec2-300m-bokmaal-v2

Automatic Speech Recognition • 0.3B • Updated Oct 14, 2024 • 40.6k
NbAiLab/nb-wav2vec2-300m-bokmaal

Automatic Speech Recognition • 0.3B • Updated Dec 9, 2024 • 937
NbAiLab/nb-wav2vec2-1b-bokmaal-v2

Automatic Speech Recognition • 1.0B • Updated Dec 27, 2024 • 956k

NB-BERT

Models based on BERT from Google, and trained on data from various sources, including the digital collection at the National Library of Norway.

Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model

Paper • 2104.09617 • Published Apr 19, 2021 • 1
NbAiLab/nb-bert-base

Fill-Mask • 0.2B • Updated Sep 7, 2023 • 8.36k • • 31
NbAiLab/nb-bert-large

Fill-Mask • 0.4B • Updated Sep 11, 2023 • 470 • 13

Newspaper Processing

Tools for processing of newspapers: cropping (TBD) and front page detection.

NbAiLab/vit-front-page-384-complete-v2

Image Classification • 86.1M • Updated Feb 4, 2025 • 4
NbAiLab/vit-front-page-384-top-v2

Image Classification • 86.1M • Updated Feb 4, 2025 • 5

🌌 Borealis Preview

Preview release of the Borealis family of instruction tuned models by the National Library of Norway.

NbAiLab/borealis-27b-instruct-preview

Image-Text-to-Text • 27B • Updated Feb 1 • 385 • 5
NbAiLab/borealis-12b-instruct-preview

Image-Text-to-Text • 12B • Updated Feb 1 • 566 • 1
NbAiLab/borealis-4b-instruct-preview

Image-Text-to-Text • 4B • Updated Dec 23, 2025 • 3.61k • 13
NbAiLab/borealis-1b-instruct-preview

Image-Text-to-Text • 1.0B • Updated Feb 1 • 1.23k • 1

NB-NoTraM-Llama 3.x

Llama 3.x models in various sizes.

NbAiLab/nb-notram-llama-3.2-1b-instruct

Text Generation • 1B • Updated Jan 9 • 1.11k • 1
NbAiLab/nb-notram-llama-3.2-3b-instruct

Text Generation • 3B • Updated Jan 9 • 915 • 2
NbAiLab/nb-notram-llama-3.1-8b-instruct

Text Generation • 8B • Updated Jan 9 • 57 • 2
NbAiLab/nb-notram-llama-3.3-70b-instruct

Text Generation • 71B • Updated Jan 9 • 35

NB-Whisper-verbatim

NB-Whisper models that are mostly suited for linguists and researchers. The output is lowercase and without punctation.

NbAiLab/nb-whisper-large-verbatim

Automatic Speech Recognition • 2B • Updated Feb 13, 2024 • 64 • 2
NbAiLab/nb-whisper-medium-verbatim

Automatic Speech Recognition • 0.8B • Updated Feb 13, 2024 • 30
NbAiLab/nb-whisper-small-verbatim

Automatic Speech Recognition • 0.2B • Updated Feb 13, 2024 • 16
NbAiLab/nb-whisper-base-verbatim

Automatic Speech Recognition • 72.6M • Updated Feb 13, 2024 • 11

NB-GPT-J

Models based on GPT-J from EleutherAI, and trained on data from various sources, including the digital collection at the National Library of Norway.

NbAiLab/nb-gpt-j-6B

Text Generation • 6B • Updated Sep 20, 2023 • 54 • 21
NbAiLab/nb-gpt-j-6B-v2

Text Generation • 6B • Updated Nov 21, 2023 • 113 • 5
NbAiLab/nb-gpt-j-6B-alpaca

Text Generation • 6B • Updated Sep 20, 2023 • 66 • 2
NbAiLab/nb-gpt-j-6B-norpaca

Text Generation • Updated Sep 20, 2023 • 6

Speech datasets

Speech data for our speech to text models

NbAiLab/NST

Updated May 20, 2025 • 156 • 4
NbAiLab/NPSC

Updated Aug 14, 2024 • 1.5k • 9
NbAiLab/NST_hesitate

Updated Sep 10, 2024 • 13

NB-Whisper Beta

Models based on Whisper from OpenAI, and trained on data from Språkbanken and the digital collection at the National Library of Norway.

NbAiLab/nb-whisper-tiny-beta

Automatic Speech Recognition • 37.8M • Updated Jul 24, 2023 • 14 • 1
NbAiLab/nb-whisper-base-beta

Automatic Speech Recognition • 72.6M • Updated Jul 24, 2023 • 16 • 1
NbAiLab/nb-whisper-small-beta

Automatic Speech Recognition • 0.2B • Updated Jul 23, 2023 • 27 • 15
NbAiLab/nb-whisper-medium-beta

Automatic Speech Recognition • 0.8B • Updated Jul 24, 2023 • 14 • 2

NB-ASR-BETA

Beta testing resources in the NB-ASR project

NbAiLab/nb-asr-beta1-Qwen06B-reading-optimised

Automatic Speech Recognition • 0.8B • Updated 1 day ago • 53 • 2

🌌 Borealis Preview

Preview release of the Borealis family of instruction tuned models by the National Library of Norway.

NbAiLab/borealis-27b-instruct-preview

Image-Text-to-Text • 27B • Updated Feb 1 • 385 • 5
NbAiLab/borealis-12b-instruct-preview

Image-Text-to-Text • 12B • Updated Feb 1 • 566 • 1
NbAiLab/borealis-4b-instruct-preview

Image-Text-to-Text • 4B • Updated Dec 23, 2025 • 3.61k • 13
NbAiLab/borealis-1b-instruct-preview

Image-Text-to-Text • 1.0B • Updated Feb 1 • 1.23k • 1

NB-Whisper

Models based on Whisper from OpenAI, and trained on data from Språkbanken and the digital collection at the National Library of Norway.

NbAiLab/nb-whisper-large-distil-turbo-beta

Automatic Speech Recognition • 0.8B • Updated Sep 10, 2025 • 4.7k • 8
NbAiLab/nb-whisper-large

Automatic Speech Recognition • 2B • Updated Jul 13, 2024 • 8.93k • 37
NbAiLab/nb-whisper-medium

Automatic Speech Recognition • 0.8B • Updated Feb 13, 2024 • 1.28k • 4
NbAiLab/nb-whisper-small

Automatic Speech Recognition • 0.2B • Updated Feb 13, 2024 • 807 • 1

NB-NoTraM-Llama 3.x

Llama 3.x models in various sizes.

NbAiLab/nb-notram-llama-3.2-1b-instruct

Text Generation • 1B • Updated Jan 9 • 1.11k • 1
NbAiLab/nb-notram-llama-3.2-3b-instruct

Text Generation • 3B • Updated Jan 9 • 915 • 2
NbAiLab/nb-notram-llama-3.1-8b-instruct

Text Generation • 8B • Updated Jan 9 • 57 • 2
NbAiLab/nb-notram-llama-3.3-70b-instruct

Text Generation • 71B • Updated Jan 9 • 35

NB-Llama 3.x Quant

Quantized version of the NB-Llama 3.x models. Due to hardware issues, we have still not been able to make quantized version of the 70B models.

NbAiLab/nb-llama-3.2-1B-Q4_K_M-GGUF

Text Generation • 1B • Updated Dec 11, 2024 • 9
NbAiLab/nb-llama-3.2-3B-Q4_K_M-GGUF

Text Generation • 3B • Updated Dec 11, 2024 • 9
NbAiLab/nb-llama-3.1-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated Dec 11, 2024 • 52 • 1
NbAiLab/nb-llama-3.2-1B-Instruct-Q4_K_M-GGUF

Text Generation • 1B • Updated Dec 11, 2024 • 28

NB-Whisper-verbatim

NB-Whisper models that are mostly suited for linguists and researchers. The output is lowercase and without punctation.

NbAiLab/nb-whisper-large-verbatim

Automatic Speech Recognition • 2B • Updated Feb 13, 2024 • 64 • 2
NbAiLab/nb-whisper-medium-verbatim

Automatic Speech Recognition • 0.8B • Updated Feb 13, 2024 • 30
NbAiLab/nb-whisper-small-verbatim

Automatic Speech Recognition • 0.2B • Updated Feb 13, 2024 • 16
NbAiLab/nb-whisper-base-verbatim

Automatic Speech Recognition • 72.6M • Updated Feb 13, 2024 • 11

NB-Wav2Vec

Models based on Wav2Vec from Meta, and trained on data from Språkbanken and the digital collection at the National Library of Norway.

Boosting Norwegian Automatic Speech Recognition

Paper • 2307.01672 • Published Jul 4, 2023 • 1
NbAiLab/nb-wav2vec2-300m-bokmaal-v2

Automatic Speech Recognition • 0.3B • Updated Oct 14, 2024 • 40.6k
NbAiLab/nb-wav2vec2-300m-bokmaal

Automatic Speech Recognition • 0.3B • Updated Dec 9, 2024 • 937
NbAiLab/nb-wav2vec2-1b-bokmaal-v2

Automatic Speech Recognition • 1.0B • Updated Dec 27, 2024 • 956k

NB-GPT-J

Models based on GPT-J from EleutherAI, and trained on data from various sources, including the digital collection at the National Library of Norway.

NbAiLab/nb-gpt-j-6B

Text Generation • 6B • Updated Sep 20, 2023 • 54 • 21
NbAiLab/nb-gpt-j-6B-v2

Text Generation • 6B • Updated Nov 21, 2023 • 113 • 5
NbAiLab/nb-gpt-j-6B-alpaca

Text Generation • 6B • Updated Sep 20, 2023 • 66 • 2
NbAiLab/nb-gpt-j-6B-norpaca

Text Generation • Updated Sep 20, 2023 • 6

NB-BERT

Models based on BERT from Google, and trained on data from various sources, including the digital collection at the National Library of Norway.

Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model

Paper • 2104.09617 • Published Apr 19, 2021 • 1
NbAiLab/nb-bert-base

Fill-Mask • 0.2B • Updated Sep 7, 2023 • 8.36k • • 31
NbAiLab/nb-bert-large

Fill-Mask • 0.4B • Updated Sep 11, 2023 • 470 • 13

Speech datasets

Speech data for our speech to text models

NbAiLab/NST

Updated May 20, 2025 • 156 • 4
NbAiLab/NPSC

Updated Aug 14, 2024 • 1.5k • 9
NbAiLab/NST_hesitate

Updated Sep 10, 2024 • 13

Newspaper Processing

Tools for processing of newspapers: cropping (TBD) and front page detection.

NbAiLab/vit-front-page-384-complete-v2

Image Classification • 86.1M • Updated Feb 4, 2025 • 4
NbAiLab/vit-front-page-384-top-v2

Image Classification • 86.1M • Updated Feb 4, 2025 • 5

NB-Whisper Beta

Models based on Whisper from OpenAI, and trained on data from Språkbanken and the digital collection at the National Library of Norway.

NbAiLab/nb-whisper-tiny-beta

Automatic Speech Recognition • 37.8M • Updated Jul 24, 2023 • 14 • 1
NbAiLab/nb-whisper-base-beta

Automatic Speech Recognition • 72.6M • Updated Jul 24, 2023 • 16 • 1
NbAiLab/nb-whisper-small-beta

Automatic Speech Recognition • 0.2B • Updated Jul 23, 2023 • 27 • 15
NbAiLab/nb-whisper-medium-beta

Automatic Speech Recognition • 0.8B • Updated Jul 24, 2023 • 14 • 2

AI & ML interests

Recent Activity

Team members 31

NbAiLab 's collections 12