Harrier-270M-Amharic

Microsoft Harrier OSS v1 270M fine-tuned on Amharic passage retrieval supervision. This model was introduced in the paper The Multilingual Curse at the Retrieval Layer: Evidence from Amharic.

Code: https://github.com/rasyosef/amharic-neural-ir

Results on Amharic Passage Retrieval Dataset V2

Model R@5 R@10 MRR@10 NDCG@10
microsoft/harrier-oss-v1-270m (zero-shot, prompted) 0.697 0.753 0.576 0.619
This model (fine-tuned) 0.860 0.903 0.760 0.795

Fine-tuning yields a +32.0% relative MRR@10 gain over zero-shot. This is the strongest Amharic-fine-tuned multilingual retriever in the paper.

Evaluation dataset: rasyosef/Amharic-Passage-Retrieval-Dataset-V2

Usage

This fine-tuned model is used without the zero-shot instruction prompt:

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("kiyam/Harrier-270M-Amharic")

queries = ["የኢትዮጵያ ዋና ከተማ የትኛው ናት?"]
passages = ["አዲስ አበባ የኢትዮጵያ ዋና ከተማ ናት።"]

query_embeddings = model.encode(queries, normalize_embeddings=True)
passage_embeddings = model.encode(passages, normalize_embeddings=True)

scores = query_embeddings @ passage_embeddings.T

This model uses Matryoshka embeddings — you can truncate to shorter dimensions (e.g. 256) for faster retrieval at a small quality cost:

query_embeddings = model.encode(queries, normalize_embeddings=True)[:, :256]

Training Details

  • Base model: microsoft/harrier-oss-v1-270m (270M parameters)
  • Training data: yosefw/amharic-news-retrieval-dataset-v2-with-negatives-V2
  • Objective: MultipleNegativesRankingLoss + MatryoshkaLoss (dims: 640, 256)
  • Epochs: 6 | Batch size: 32 | Grad. accum.: 4
  • Learning rate: 4e-5, cosine schedule | Precision: BF16
  • Max sequence length: 512

Citation

@inproceedings{alemneh2026amharicir,
  title     = {The Multilingual Curse at the Retrieval Layer: Evidence from Amharic},
  author    = {Alemneh, Yosef Worku and Mekonnen, Kidist Amde and de Rijke, Maarten},
  booktitle = {Proceedings of the 1st Workshop on Multilinguality in the Era of Large Language Models (MeLLM), ACL 2026},
  year      = {2026},
}
Downloads last month
61
Safetensors
Model size
0.3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kiyam/Harrier-270M-Amharic

Finetuned
(8)
this model

Dataset used to train kiyam/Harrier-270M-Amharic

Collection including kiyam/Harrier-270M-Amharic

Paper for kiyam/Harrier-270M-Amharic