Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mjtechguy
/
phi-4-multimodal-instruct
like
0
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
Train
Use this model
main
phi-4-multimodal-instruct
2 contributors
History:
1 commit
mjtechguy
gargamit
Duplicate from microsoft/Phi-4-multimodal-instruct
df2cb0d
verified
about 23 hours ago
examples
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
figures
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
speech-lora
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
vision-lora
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
.gitattributes
Safe
1.61 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
CODE_OF_CONDUCT.md
Safe
444 Bytes
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
LICENSE
Safe
1.14 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
README.md
Safe
54.5 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
SECURITY.md
Safe
2.66 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
SUPPORT.md
Safe
1.24 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
added_tokens.json
Safe
249 Bytes
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
config.json
Safe
4.63 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
configuration_phi4mm.py
Safe
11 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
generation_config.json
Safe
190 Bytes
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
merges.txt
Safe
2.42 MB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
model-00001-of-00003.safetensors
Safe
5 GB
LFS
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
model-00002-of-00003.safetensors
Safe
4.95 GB
LFS
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
model-00003-of-00003.safetensors
Safe
1.2 GB
LFS
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
model.safetensors.index.json
Safe
240 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
modeling_phi4mm.py
Safe
116 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
phi_4_mm.tech_report.02252025.pdf
Safe
5.3 MB
LFS
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
preprocessor_config.json
Safe
482 Bytes
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
processing_phi4mm.py
Safe
32.8 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
processor_config.json
Safe
121 Bytes
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
sample_finetune_speech.py
Safe
16.7 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
sample_finetune_vision.py
Safe
19.6 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
sample_inference_phi4mm.py
Safe
10.5 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
special_tokens_map.json
Safe
473 Bytes
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
speech_conformer_encoder.py
Safe
111 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
tokenizer.json
Safe
15.5 MB
LFS
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
tokenizer_config.json
Safe
3.25 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
vision_siglip_navit.py
Safe
78.2 kB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago
vocab.json
Safe
3.91 MB
Duplicate from microsoft/Phi-4-multimodal-instruct
about 23 hours ago