chuuhtetnaing
/

whisper-tiny-myanmar

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions

chuuhtetnaing commited on Aug 31, 2024

Commit

cbe5f88

·

verified ·

1 Parent(s): a65cac0

Update README.md

Files changed (1) hide show

README.md +21 -11

README.md CHANGED Viewed

@@ -8,6 +8,12 @@ metrics:
 model-index:
 - name: whisper-tiny-myanmar
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,24 +21,28 @@ should probably proofread and complete it, then remove this comment. -->
 # whisper-tiny-myanmar
-This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.2353
 - Wer: 61.8878
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -88,4 +98,4 @@ The following hyperparameters were used during training:
 - Transformers 4.35.2
 - Pytorch 2.1.1+cu121
 - Datasets 2.14.5
-- Tokenizers 0.15.1

 model-index:
 - name: whisper-tiny-myanmar
   results: []
+datasets:
+- chuuhtetnaing/myanmar-speech-dataset-openslr-80
+language:
+- my
+pipeline_tag: automatic-speech-recognition
+library_name: transformers
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # whisper-tiny-myanmar
+This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the [chuuhtetnaing/myanmar-speech-dataset-openslr-80](https://huggingface.co/datasets/chuuhtetnaing/myanmar-speech-dataset-openslr-80) dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.2353
 - Wer: 61.8878
+## Usage
+```python
+from datasets import Audio, load_dataset
+from transformers import pipeline
+# Load a sample audio
+dataset = load_dataset("chuuhtetnaing/myanmar-speech-dataset-openslr-80")
+dataset = dataset.cast_column("audio", Audio(sampling_rate=16000))
+test_dataset = dataset['test']
+input_speech = test_dataset[42]['audio']
+pipe = pipeline(model='chuuhtetnaing/whisper-tiny-myanmar')
+output = pipe(input_speech, generate_kwargs={"language": "myanmar", "task": "transcribe"})
+print(output['text']) # ကျွန်မ ပြည်ပ မှာ ပညာ သင် တော့ စာမြီးပွဲ ကို တပတ်တခါ စစ်တယ်
+```
 ### Training hyperparameters
 - Transformers 4.35.2
 - Pytorch 2.1.1+cu121
 - Datasets 2.14.5
+- Tokenizers 0.15.1