PleIAs
/

KaribuAI

Text Classification

Safetensors

English

French

deberta

Model card Files Files and versions Community

irenegirard commited on 24 days ago

Commit

1ed25f6

verified ·

1 Parent(s): 28600cd

Update README.md

Browse files

Files changed (1) hide show

README.md +35 -10

README.md CHANGED Viewed

@@ -16,25 +16,50 @@ base_model:
 The Karibu project is a collaboration between pleIAs, Bibliothèque sans frontière (BSF) and Kajou. Our platform delivers comprehensive educational activities across six CEFR proficiency levels (A1 to C2), making quality language learning accessible to all, even in offline environments through microSD card deployment. By combining reading comprehension, interactive exercises, and personalized learning paths, Karibu creates an immersive educational experience that adapts to each learner's needs.
-## Text Classification Model
-Our innovative approach begins with the creation of a rich, diverse corpus of educational content. Drawing from high-quality sources and utilizing advanced AI models, we've developed a sophisticated methodology for generating educational content based on French press articles available online (model to come). Each text undergoes a careful transformation process to create variations suitable for different proficiency levels, ensuring that learners at every stage have access to appropriate, engaging content. This systematic approach allows us to maintain high educational standards while scaling our content library effectively.
 🔍 [Explore the full dataset](https://huggingface.co/datasets/PleIAs/KaribuAI/viewer/default)
-## Cultural Relevance and Ethical Content Curation
-Understanding the importance of cultural context in language learning, we've implemented a robust content filtering system that ensures all materials are not only educationally sound but also culturally sensitive. Our platform covers diverse topics including solidarity, African literature and history, agriculture, tourism, and cross-cultural communication. This careful curation process, powered by Celadon, guarantees that learning materials resonate with our users' experiences and educational needs while maintaining the highest standards of ethical content delivery.
-🤖 [Explore the Celadon model](https://huggingface.co/PleIAs/celadon)
-## Advanced Level Classification
-Our classification system precisely evaluates and assigns appropriate difficulty levels to all educational content. The system utilizes DeBERTa (Decoding-enhanced BERT with Disentangled Attention) to capture the subtle linguistic features that distinguish different CEFR levels, from basic A1 constructions to advanced C2 language use. This precision allows for consistent, reliable assessment and appropriate content delivery, creating a foundational framework for personalized learning experiences.
-## AI-Powered Tutoring Experience
-Karibu transforms traditional language learning through its innovative dual-component system. Each learning block combines two key elements: interactive H5P-formatted exercises (including quizzes, drag-and-drop activities, and multimedia content) and an AI tutoring system for essay evaluation. The AI tutor analyzes written submissions in detail, identifying grammatical errors, suggesting improvements, and providing targeted feedback. By analyzing user performance across both structured exercises and free-form writing, our platform creates personalized learning pathways that adapt to each student's progress. Unlike conventional systems limited to multiple-choice questions and scripted interactions, Karibu offers natural, dynamic learning experiences that develop real-world language skills. Our focus on practical, task-based learning modules ensures that educators can immediately apply their knowledge in real-world teaching contexts, creating a multiplier effect that benefits entire learning communities.
-Karibu not only provides cutting-edge language learning tools but also contributes to the democratization of education in geographically isolated areas. Our commitment to open solutions ensures frugality, transparency, and local adaptability, making Karibu a truly transformative force in language education.

 The Karibu project is a collaboration between pleIAs, Bibliothèque sans frontière (BSF) and Kajou. Our platform delivers comprehensive educational activities across six CEFR proficiency levels (A1 to C2), making quality language learning accessible to all, even in offline environments through microSD card deployment. By combining reading comprehension, interactive exercises, and personalized learning paths, Karibu creates an immersive educational experience that adapts to each learner's needs.
+## Karibu Language Level Classifier
+Karibu is a DeBERTa-based classifier that automatically assigns CEFR language proficiency levels (A1-C2) to French educational content.
+Model Characteristics
+## Architecture: DeBERTa with multi-head classification
+Base Model: PleIAs/celadon
+Model Size: Fine-tuned from DeBERTa-v3-small
+Output: 6 classification levels (A1, A2, B1, B2, C1, C2)
+🤖 [Explore the Celadon model](https://huggingface.co/PleIAs/celadon)
+## Training Details
+Training Data: 9,000 synthetic samples
+Source: French press articles + Wikimedia content
+Processing: Sequential text simplification using an open source model (to come)
+Validation: 1,000 samples per level manually verified by BSF experts
+## Topics Coverage:
+- solidarity, geography, African literature, agriculture, tourism, cultural events, African history, geopolitics, communication
+Topic Filtering: Meta-Llama-3-8B-Instruct for content categorization
+Annotation Method:
 🔍 [Explore the full dataset](https://huggingface.co/datasets/PleIAs/KaribuAI/viewer/default)
+## levels
+Manual verification using CEFR framework criteria
+Statistical validation using Louvain word-level classification
+## Technical Integration
+Deployment: Offline-capable via microSD cards
+Format: H5P-compatible for interactive exercises
+Input Processing: Handles various text types (academic writing, press articles, emails, letters, stories)
+## Collaborators
+PleIAs: Technical development
+Bibliothèque Sans Frontières (BSF): Educational expertise
+Kajou: Distribution platform