ddrg
/

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co./docs/hub/model-cards#model-card-metadata)

CodeColBERT

This model serves as the base for our semantic code retrieval system SELMA. It can be applied for indexing and retrieval using the Pyterrier bindings for ColBERT.

Training Details

This model was trained for code retrieval. As a base, CodeBERT is used. It is trained using the official ColBERTv2 code (Github).

Our data source is the CodeSearchNet Challenge. Training ColBERT requires a tripes of queries, positive examples and negative examples. As queries, we used the documentation provided for each sample in the CodeSearchNet data set, while its code snippet serves as the positive example. Negative examples were sampled randomly from the corpus. In total, we train for 400.000 steps.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.