Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.55k
Follow
Microsoft
17k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2503.01743
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
xet
Community
86
Deploy
Use this model
bd4b39b
Phi-4-multimodal-instruct
12.9 GB
14 contributors
History:
3 commits
nguyenbh
Add examples
bd4b39b
10 months ago
examples
Add examples
10 months ago
figures
Added model files
10 months ago
speech-lora
Added model files
10 months ago
vision-lora
Added model files
10 months ago
.gitattributes
Safe
1.57 kB
Added model files
10 months ago
CODE_OF_CONDUCT.md
Safe
444 Bytes
Added model files
10 months ago
LICENSE
Safe
1.14 kB
Added model files
10 months ago
README.md
51.7 kB
Added model files
10 months ago
SECURITY.md
Safe
2.66 kB
Added model files
10 months ago
SUPPORT.md
Safe
1.24 kB
Added model files
10 months ago
added_tokens.json
Safe
249 Bytes
Added model files
10 months ago
config.json
Safe
4.63 kB
Added model files
10 months ago
configuration_phi4mm.py
Safe
11 kB
Added model files
10 months ago
generation_config.json
Safe
190 Bytes
Added model files
10 months ago
merges.txt
Safe
2.42 MB
Added model files
10 months ago
model-00001-of-00003.safetensors
Safe
5 GB
xet
Added model files
10 months ago
model-00002-of-00003.safetensors
Safe
4.95 GB
xet
Added model files
10 months ago
model-00003-of-00003.safetensors
Safe
1.2 GB
xet
Added model files
10 months ago
model.safetensors.index.json
Safe
240 kB
Added model files
10 months ago
modeling_phi4mm.py
Safe
116 kB
Added model files
10 months ago
preprocessor_config.json
Safe
482 Bytes
Added model files
10 months ago
processing_phi4mm.py
Safe
32.8 kB
Added model files
10 months ago
processor_config.json
Safe
121 Bytes
Added model files
10 months ago
sample_finetune_speech.py
Safe
16.7 kB
Added model files
10 months ago
sample_finetune_vision.py
Safe
19.6 kB
Added model files
10 months ago
sample_inference_phi4mm.py
Safe
10.5 kB
Added model files
10 months ago
special_tokens_map.json
Safe
473 Bytes
Added model files
10 months ago
speech_conformer_encoder.py
Safe
111 kB
Added model files
10 months ago
tokenizer.json
Safe
15.5 MB
xet
Added model files
10 months ago
tokenizer_config.json
Safe
3.25 kB
Added model files
10 months ago
vision_siglip_navit.py
Safe
78.2 kB
Added model files
10 months ago
vocab.json
Safe
3.91 MB
Added model files
10 months ago