Spaces:

NbAiLab
/

nb-tts-rubric

Running

App Files Files Community

kathiasi commited on 29 days ago

Commit

842bbce

verified ·

1 Parent(s): f2542fa

Initiation

Browse files

Files changed (16) hide show

.gitattributes +7 -0
README.md +39 -4
README_INSTRUCTIONS.md +37 -0
Rubric for choosing a TTS voice.docx +0 -0
app.py +430 -0
config_mos.yaml +37 -0
config_original.yaml +93 -0
download_dataset.py +94 -0
requirements.txt +8 -0
sample-audios/sorsamisk_-_115_01_-_Goevten_voestes_biejjie_015_024_s.wav +3 -0
sample-audios/sorsamisk_-_7_01_-_Jarkoestidh_028_020.wav +3 -0
sample-audios/sorsamisk_-_III_01_-_Giesie_eejehtimmiebiejjieh_015_015.wav +3 -0
sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd1_mono_022_009.wav +3 -0
sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd1_mono_030_014.wav +3 -0
sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd2_009_TRN_019.wav +3 -0
sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd2_009_TRN_019_s.wav +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,10 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_-_115_01_-_Goevten_voestes_biejjie_015_024_s.wav filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_-_7_01_-_Jarkoestidh_028_020.wav filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_-_III_01_-_Giesie_eejehtimmiebiejjieh_015_015.wav filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd1_mono_022_009.wav filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd1_mono_030_014.wav filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd2_009_TRN_019_s.wav filter=lfs diff=lfs merge=lfs -text
+sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd2_009_TRN_019.wav filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,12 +1,47 @@
 ---
-title: Nb Tts Rubric
-emoji: 📉
-colorFrom: pink
-colorTo: pink
 sdk: gradio
 sdk_version: 5.49.1
 app_file: app.py
 pinned: false
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: Tts Online Rubric
+emoji: 🌍
+colorFrom: indigo
+colorTo: red
 sdk: gradio
 sdk_version: 5.49.1
 app_file: app.py
 pinned: false
+license: cc-by-4.0
+short_description: A rubric for choosing a good TTS voice
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
+## Saving responses to a private Hugging Face dataset
+This project can optionally push saved responses to a private dataset on the Hugging Face Hub. The feature is disabled by default and only active when the following environment variables are set:
+- `HF_TOKEN` — a Hugging Face access token with dataset write permissions.
+- `HF_DATASET_ID` — repo id like `your-username/my-eval-responses`.
+Setup:
+1. Create a token on https://huggingface.co/settings/tokens and grant it write permissions for datasets/repos.
+2. Export the variables locally before running the app:
+```bash
+export HF_TOKEN="hf_...your_token..."
+export HF_DATASET_ID="your-username/my-eval-responses"
+```
+3. Install dependencies and run the app:
+```bash
+pip install -r requirements.txt
+python app.py
+# open http://localhost:7860 and submit ratings via the UI
+```
+Notes:
+- If the `datasets` or `huggingface_hub` packages are not installed, HF pushing is skipped gracefully and responses are still written to `responses.csv`.
+- Pushing audio files will use Git LFS on the Hub; monitor your account storage and LFS quota.
+- Each UI save triggers a small push (single-record commit). For heavy usage consider batching pushes instead.
+There is also a small test script included to programmatically verify pushing a single sample row to your dataset. See `hf_push_test.py`.

README_INSTRUCTIONS.md ADDED Viewed

	@@ -0,0 +1,37 @@

+# TTS Online Rubric (Gradio)
+This is a minimal Gradio-based UI for running TTS rubric evaluations. It's designed to be deployed on Hugging Face Spaces or run locally.
+Features:
+- Browse audio files from `sample-audios/`.
+- Play reference audio and upload system outputs to compare.
+- Rate outputs on three sliders (nativeness, naturalness, overall quality) and add comments.
+- Responses are appended to `responses.csv`.
+How to run locally:
+1. Create a Python environment and install requirements:
+```bash
+python -m venv .venv
+source .venv/bin/activate
+pip install -r requirements.txt
+```
+2. Add your reference audio files under `sample-audios/`.
+3. Run the app:
+```bash
+python app.py
+```
+4. Open the URL shown in the terminal (default http://127.0.0.1:7860).
+Deploying to Hugging Face Spaces:
+- Create a new Space using the Gradio SDK (python) and push this repo. Ensure `requirements.txt` is present. The app will run automatically.
+Notes and next steps:
+- The current UI accepts system outputs via file upload. You may prefer a fixed set of system files stored in a directory for side-by-side playback.
+- Consider adding authentication or locking to avoid duplicate/corrupt CSV writes under heavy load.

Rubric for choosing a TTS voice.docx ADDED Viewed

Binary file (9.11 kB). View file

app.py ADDED Viewed

	@@ -0,0 +1,430 @@

+import gradio as gr
+import os
+import csv
+import fcntl
+from datetime import datetime
+import uuid
+import yaml # You need to install this: pip install pyyaml
+import glob
+import random
+import json
+import pandas as pd
+import io
+# --- Hugging Face Functionality Notes ---
+# To save results to a private Hugging Face dataset, you must:
+# 1. Install the required libraries: pip install huggingface_hub datasets
+# 2. Set the following environment variables before running the script:
+#    - HF_TOKEN: Your Hugging Face access token with write permissions.
+#    - HF_DATASET_ID: The ID of the private dataset repo (e.g., "username/my-dataset").
+# If these are not set, saving to HF Hub will be skipped.
+# --- Start of Local Mode Implementation ---
+IS_LOCAL_MODE = os.environ.get("GRADIO_LOCAL_MODE", "false").lower() in ["true", "1"]
+if IS_LOCAL_MODE:
+    print("Running in LOCAL mode. Hugging Face functionalities are disabled.")
+    HfApi = None
+else:
+    try:
+        from huggingface_hub import HfApi, hf_hub_download
+        print("Hugging Face libraries found. HF push functionality is available.")
+    except ImportError:
+        print("Hugging Face libraries not found. HF push functionality will be disabled.")
+        HfApi = None
+# --- End of Local Mode Implementation ---
+# --- Configuration Loading ---
+def load_config(config_path='config.yaml'):
+    """Loads the UI and criteria configuration from a YAML file."""
+    try:
+        with open(config_path, 'r', encoding='utf-8') as f:
+            config = yaml.safe_load(f)
+        if 'criteria' not in config or not isinstance(config['criteria'], list):
+            raise ValueError("Config must contain a list of 'criteria'.")
+        return config
+    except FileNotFoundError:
+        return None
+    except Exception as e:
+        print(f"ERROR: Could not parse {config_path}: {e}")
+        return None
+def find_config_files():
+    """Finds all .yaml and .yml files in the root directory."""
+    return glob.glob("*.yaml") + glob.glob("*.yml")
+# --- Static & File I/O Functions ---
+OUTPUT_CSV = "responses.csv"
+MAX_CRITERIA = 15 # Maximum number of sliders to support
+def list_samples(samples_dir):
+    """Lists audio files from a specified directory."""
+    if not os.path.isdir(samples_dir):
+        print(f"WARNING: Samples directory '{samples_dir}' not found.")
+        return []
+    files = [f for f in os.listdir(samples_dir) if f.lower().endswith(('.wav', '.mp3', '.ogg', '.flac'))]
+    files.sort()
+    return files
+def save_responses_to_hf(rows, repo_id: str | None = None, token: str | None = None):
+    """
+    Append new rows to a CSV file in a private Hugging Face dataset.
+    - Reads the existing CSV (if present).
+    - Appends new rows.
+    - Uploads the updated file back to the repo.
+    Each 'row' should be a dict with consistent keys.
+    NOTE:
+    - Replaces the entire CSV on each update (no true append on the server side).
+    - Use small/medium datasets; large ones should use the `datasets` library instead.
+    """
+    if HfApi is None:
+        return {"status": "hf_unavailable", "reason": "missing_packages"}
+    token = token or os.environ.get("HF_TOKEN")
+    repo_id = repo_id or os.environ.get("HF_DATASET_ID")
+    if not token or not repo_id:
+        return {"status": "hf_skipped", "reason": "missing_token_or_repo_env"}
+    api = HfApi(token=token)
+    path_in_repo = "data/responses.csv"  # fixed CSV location in repo
+    repo_err = None
+    # Ensure dataset exists
+    try:
+        api.create_repo(repo_id=repo_id, repo_type="dataset", private=True, exist_ok=True)
+    except Exception as e:
+        repo_err = str(e)
+    # Try downloading existing CSV
+    existing_df = pd.DataFrame()
+    try:
+        local_path = hf_hub_download(
+            repo_id=repo_id,
+            filename=path_in_repo,
+            repo_type="dataset",
+            token=token,
+        )
+        existing_df = pd.read_csv(local_path)
+    except Exception as e:
+        print("file", path_in_repo, "couldn't be found / read", str(e))
+        # File doesn't exist or is unreadable — start fresh
+        pass
+    # Convert new rows to DataFrame and append
+    new_df = pd.DataFrame(rows)
+    combined_df = pd.concat([existing_df, new_df], ignore_index=True)
+    print(combined_df)
+    # Save to memory as CSV
+    csv_buffer = io.StringIO()
+    combined_df.to_csv(csv_buffer, index=False)
+    csv_bytes = csv_buffer.getvalue().encode("utf-8")
+    # Upload the updated CSV
+    try:
+        api.upload_file(
+            path_or_fileobj=csv_bytes,
+            path_in_repo=path_in_repo,
+            repo_id=repo_id,
+            repo_type="dataset",
+        )
+    except Exception as e:
+        print(str(e))
+        return {"status": "hf_push_error", "error": str(e), "repo_error": repo_err}
+    return {"status": "hf_pushed", "rows_added": len(rows), "repo": repo_id, "repo_error": repo_err}
+def _save_responses_to_hf(rows, repo_id: str | None = None, token: str | None = None):
+    """
+    Push a list of dict rows to a private HF dataset, one JSON file per row.
+    NOTE: This approach saves each response as an individual file. While this
+    prevents data loss from overwriting a single file, be aware of the following:
+    - Performance: Uploading many small files can be slower than a single large one.
+    - Scalability: A very large number of files (e.g., millions) can make the
+      dataset repository unwieldy to browse or clone.
+    - Loading Data: To load this data back into a `datasets.Dataset` object, you
+      will need to point to the specific files, for example:
+      `load_dataset('json', data_files='path/to/your/repo/data/*.json')`
+    """
+    if HfApi is None:
+        return {"status": "hf_unavailable", "reason": "missing_packages"}
+    token = token or os.environ.get("HF_TOKEN")
+    repo_id = repo_id or os.environ.get("HF_DATASET_ID")
+    if not token or not repo_id:
+        return {"status": "hf_skipped", "reason": "missing_token_or_repo_env"}
+    api = HfApi(token=token)
+    repo_err = None
+    try:
+        api.create_repo(repo_id=repo_id, repo_type="dataset", private=True, exist_ok=True)
+    except Exception as e:
+        repo_err = str(e)
+    # Process each row, uploading it as a separate JSON file
+    num_pushed = 0
+    errors = []
+    for row_dict in rows:
+        try:
+            # Create a unique filename. Using a UUID is the most robust method.
+            filename = f"{uuid.uuid4()}.json"
+            # Place files in a 'data' subdirectory to keep the repo root clean.
+            path_in_repo = f"data/{filename}"
+            # Convert the dictionary to JSON bytes for uploading
+            json_bytes = json.dumps(row_dict, indent=2).encode("utf-8")
+            api.upload_file(
+                path_or_obj=json_bytes,
+                path_in_repo=path_in_repo,
+                repo_id=repo_id,
+                repo_type="dataset",
+            )
+            num_pushed += 1
+        except Exception as e:
+            errors.append(str(e))
+    if errors:
+        print("json errors", errors, "repo errors", repo_err)
+        return {"status": "hf_push_error", "pushed": num_pushed, "total": len(rows), "errors": errors, "repo_error": repo_err}
+    return {"status": "hf_pushed", "rows": len(rows), "repo": repo_id, "repo_error": repo_err}
+def save_response(sample, audio_path, annotator, session_id, user_email, comment, scores, config):
+    """Saves a response row locally and attempts to push to Hugging Face Hub."""
+    os.makedirs(os.path.dirname(OUTPUT_CSV) or '.', exist_ok=True)
+    criteria_labels = [c['label'] for c in config['criteria']]
+    header = ["timestamp", "sample", "audio_path", "annotator", "session_id", "user_email"] + criteria_labels + ["comment"]
+    active_scores = list(scores)[:len(criteria_labels)]
+    row = [datetime.utcnow().isoformat(), sample, audio_path, annotator, session_id, user_email] + active_scores + [comment]
+    write_header = not os.path.exists(OUTPUT_CSV)
+    with open(OUTPUT_CSV, "a", newline='', encoding='utf-8') as f:
+        try: fcntl.flock(f.fileno(), fcntl.LOCK_EX)
+        except Exception: pass
+        writer = csv.writer(f)
+        if write_header: writer.writerow(header)
+        writer.writerow(row)
+        try: fcntl.flock(f.fileno(), fcntl.LOCK_UN)
+        except Exception: pass
+    # --- Hugging Face Push Logic ---
+    hf_result = None
+    if not IS_LOCAL_MODE:
+        try:
+            hf_record = dict(zip(header, row))
+            hf_result = save_responses_to_hf([hf_record])
+        except Exception as e:
+            print(e)
+            hf_result = {"status": "hf_error", "error": str(e)}
+    return {"status": "saved", "sample": sample, "hf": hf_result}
+# --- Gradio UI Definition ---
+def make_ui():
+    def make_explainer_fn(criterion_index):
+        def explainer(value, config):
+            if not config or criterion_index >= len(config.get('criteria', [])): return ""
+            criterion = config['criteria'][criterion_index]
+            try: iv = int(value)
+            except (ValueError, TypeError): iv = value
+            text = criterion['explanations'].get(iv, "No description for this score.")
+            return f"**{criterion['label']} ({iv}/{criterion['max']}):** {text}"
+        return explainer
+    #with gr.Blocks(theme=gr.themes.Soft(primary_hue="blue")) as demo:
+    with gr.Blocks() as demo:
+        # --- STATE MANAGEMENT ---
+        samples_list = gr.State()
+        current_index = gr.State(0)
+        config_state = gr.State()
+        session_id_global = gr.State()
+        # --- SETUP UI (Visible at start) ---
+        with gr.Group() as setup_group:
+            gr.Markdown("# Evaluation Setup")
+            gr.Markdown("Please provide your details and select the evaluation setup to begin.")
+            #config_dropdown = gr.Dropdown(choices=find_config_files(), label="Select Evaluation", value=find_config_files()[0] if find_config_files() else "")
+            config_dropdown = gr.Dropdown(choices=find_config_files(), label="Select Evaluation", value=None) # if find_config_files() else "")
+            instructions_md = gr.Markdown(visible=False, elem_classes="instructions")
+            with gr.Accordion("Annotator Info", open=True):
+                annotator_global = gr.Textbox(label="Annotator ID", lines=1)
+                user_email_global = gr.Textbox(label="User email (optional)", lines=1)
+            start_button = gr.Button("Start Evaluation", variant="primary")
+            config_error_md = gr.Markdown("", visible=False)
+        # --- MAIN EVALUATION UI (Initially hidden) ---
+        with gr.Group(visible=False) as main_group:
+            title_md = gr.Markdown("# Evaluation UI")
+            header_md = gr.Markdown("")
+            progress_md = gr.Markdown("Sample 1 of X")
+            with gr.Row():
+                with gr.Column(scale=1, variant='panel'):
+                    sample_name_md = gr.Markdown("### Audio File")
+                    gr.Markdown("---")
+                    evaluation_audio = gr.Audio(label="Audio for Evaluation")
+                    gr.Markdown("---")
+                    submit_btn = gr.Button("Save & Next", variant="primary", interactive=False)
+                    status = gr.Textbox(label="Status", interactive=False)
+                with gr.Column(scale=2, variant='panel'):
+                    gr.Markdown("### Scoring Criteria")
+                    slider_explanation_md = gr.Markdown("_Move a slider to see the description for each score._")
+                    gr.Markdown("---")
+                    sliders = [gr.Slider(visible=False, interactive=True) for _ in range(MAX_CRITERIA)]
+                    gr.Markdown("---")
+                    comment = gr.Textbox(label="Comments (optional)", lines=4, placeholder="Enter any additional feedback here...")
+        # --- UI ELEMENT LISTS ---
+        main_ui_elements = [
+            title_md, header_md, progress_md, sample_name_md, evaluation_audio,
+            slider_explanation_md, comment, submit_btn, status, *sliders
+        ]
+        # --- LOGIC & EVENTS ---
+        def load_sample(samples, index, config):
+            total_samples = len(samples)
+            updates = {}
+            if index >= total_samples:
+                completion_msg = f"**All {total_samples} samples completed! Thank you!**"
+                for el in main_ui_elements: updates[el] = gr.update(visible=False)
+                updates[progress_md] = gr.update(value=completion_msg, visible=True)
+                updates[status] = gr.update(value="Finished.", visible=True)
+                return updates
+            sample = samples[index]
+            samples_dir = config.get('samples_directory', 'sample-audios')
+            sample_path = os.path.join(samples_dir, sample)
+            sample_exists = os.path.exists(sample_path)
+            updates = {
+                progress_md: gr.update(value=f"Sample **{index + 1}** of **{total_samples}**", visible=True),
+                sample_name_md: gr.update(value=f"### File: `{sample}`", visible=True),
+                evaluation_audio: gr.update(value=sample_path if sample_exists else None, visible=sample_exists),
+                slider_explanation_md: gr.update(value="_Move a slider to see the description for each score._", visible=True),
+                comment: gr.update(value="", visible=True),
+                submit_btn: gr.update(value="Play audio to enable", interactive=False, visible=True),
+                status: gr.update(value="Ready.", visible=True)
+            }
+            num_criteria = len(config['criteria'])
+            for i in range(MAX_CRITERIA):
+                if i < num_criteria:
+                    criterion = config['criteria'][i]
+                    updates[sliders[i]] = gr.update(
+                        label=criterion['label'], minimum=criterion['min'], maximum=criterion['max'],
+                        step=criterion['step'], value=criterion['default'], visible=True
+                    )
+                else:
+                    updates[sliders[i]] = gr.update(visible=False, value=0)
+            return updates
+        def enable_submit_button():
+            return gr.update(value="Save & Next", interactive=True)
+        def update_instructions(config_path):
+            if not config_path: return gr.update(value="", visible=False)
+            config = load_config(config_path)
+            if config and 'instructions_markdown' in config:
+                return gr.update(value=config['instructions_markdown'], visible=True)
+            return gr.update(value="", visible=False)
+        def start_session(config_path):
+            if not config_path or not os.path.exists(config_path):
+                return {config_error_md: gr.update(value="**Error:** Please select a valid configuration file.", visible=True)}
+            config = load_config(config_path)
+            if config is None:
+                return {config_error_md: gr.update(value=f"**Error:** Could not load or parse `{config_path}`. Check console for details.", visible=True)}
+            samples_dir = config.get('samples_directory', 'sample-audios')
+            should_randomize = config.get('randomize_samples', False)
+            s_list = list_samples(samples_dir)
+            if not s_list:
+                return {config_error_md: gr.update(value=f"**Error:** No audio files found in directory: `{samples_dir}`", visible=True)}
+            if should_randomize: random.shuffle(s_list)
+            session_id = str(uuid.uuid4())
+            index = 0
+            updates = {
+                setup_group: gr.update(visible=False),
+                main_group: gr.update(visible=True),
+                config_error_md: gr.update(visible=False),
+                title_md: gr.update(value=f"# {config.get('title', 'Evaluation UI')}"),
+                header_md: gr.update(value=config.get('header_markdown', '')),
+                config_state: config,
+                session_id_global: session_id,
+                samples_list: s_list,
+                current_index: index,
+            }
+            sample_updates = load_sample(s_list, index, config)
+            updates.update(sample_updates)
+            return updates
+        def save_and_next(index, samples, annotator, sid, email, comment, config, *scores):
+            sample = samples[index]
+            samples_dir = config.get('samples_directory', 'sample-audios')
+            sample_path = os.path.join(samples_dir, sample)
+            save_status = save_response(sample, sample_path, annotator, sid, email, comment, scores, config)
+            next_index = index + 1
+            updates_dict = load_sample(samples, next_index, config)
+            # Provide more detailed status, including HF info if available
+            status_message = f"Saved {sample} locally."
+            if save_status.get('hf'):
+                hf_stat = save_status['hf'].get('status', 'hf_unknown')
+                status_message += f" HF status: {hf_stat}."
+            updates_dict[status] = gr.update(value=status_message)
+            ordered_updates = [updates_dict.get(el) for el in main_ui_elements]
+            return [next_index] + ordered_updates
+        # --- Event Wiring ---
+        config_dropdown.change(
+            update_instructions, inputs=[config_dropdown], outputs=[instructions_md]
+        ).then(None, None, None, js="() => { document.getElementById('component-0').scrollIntoView(); }")
+        start_button.click(
+            start_session,
+            inputs=[config_dropdown],
+            outputs=[
+                setup_group, main_group, config_error_md, *main_ui_elements,
+                config_state, session_id_global, samples_list, current_index
+            ]
+        )
+        submit_btn.click(
+            save_and_next,
+            inputs=[current_index, samples_list, annotator_global, session_id_global, user_email_global, comment, config_state, *sliders],
+            outputs=[current_index, *main_ui_elements]
+        )
+        for i, slider in enumerate(sliders):
+            slider.change(make_explainer_fn(i), inputs=[slider, config_state], outputs=[slider_explanation_md])
+        evaluation_audio.play(fn=enable_submit_button, inputs=None, outputs=[submit_btn])
+        demo.load(update_instructions, inputs=config_dropdown, outputs=instructions_md)
+    return demo
+if __name__ == "__main__":
+    app = make_ui()
+    app.launch(server_name="0.0.0.0", server_port=7860)

config_mos.yaml ADDED Viewed

	@@ -0,0 +1,37 @@

+# Configuration for a standard Mean Opinion Score (MOS) test.
+title: "MOS Test - Audio Quality Evaluation"
+header_markdown: "Listen to the audio sample and rate its overall quality on a scale of 1 to 5."
+instructions_markdown: |
+  **Welcome, Annotator!**
+  Instructions for MOS test:
+  Please follow these steps carefully:
+  1.  Enter your unique **Annotator ID** before you begin.
+  2.  Listen to each audio clip from start to finish.
+  3.  Rate the clip using the sliders provided based on the scoring guide.
+  4.  Provide any extra details in the comments box.
+  5.  Click 'Save & Next' to submit your rating and load the next clip.
+# The directory where your audio files are stored.
+samples_directory: "sample-audios"
+# Set to 'true' to shuffle the audio files, 'false' for alphabetical order.
+randomize_samples: true
+# MOS tests typically use a single criterion for overall quality.
+criteria:
+  - label: "Overall Quality"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    # These are standard definitions for the 5-point Absolute Category Rating (ACR) scale.
+    explanations:
+      1: "Bad - The quality is very distracting and unpleasant."
+      2: "Poor - The quality is distracting and annoying."
+      3: "Fair - The quality is slightly distracting, but acceptable."
+      4: "Good - The quality is not distracting, it is fine."
+      5: "Excellent - The quality is flawless and natural."

config_original.yaml ADDED Viewed

	@@ -0,0 +1,93 @@

+# General UI Configuration
+title: "TTS Rubric — Dynamic Evaluation"
+instructions_markdown: |
+  **Welcome annotator!**
+  Instructions for multiple aspect test
+  Please follow these steps carefully:
+  1.  Enter your unique **Annotator ID** before you begin.
+  2.  Listen to each audio clip from start to finish.
+  3.  Rate the clip using the sliders provided based on the scoring guide.
+  4.  Provide any extra details in the comments box.
+  5.  Click 'Save & Next' to submit your rating and load the next clip.
+# The directory where your audio files are stored.
+samples_directory: "sample-audios"
+# Set to 'true' to shuffle the audio files, 'false' for alphabetical order.
+randomize_samples: true
+# Define the evaluation criteria. The UI will be built from this list.
+criteria:
+  - label: "Clarity & Intelligibility"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Unacceptable."
+      2: "Often unclear or distorted; difficult to follow."
+      3: "Understandable but requires effort; some words unclear."
+      4: "Mostly clear, minor issues (with fast/slow playback)."
+      5: "Speech is clear, easy to understand (at all speeds)."
+  - label: "Accent & Pronunciation"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Severe pronunciation problems; largely unintelligible."
+      2: "Frequent pronunciation issues that impede understanding."
+      3: "Some mispronunciations that require effort to interpret."
+      4: "Minor pronunciation quirks but overall fine."
+      5: "Pronunciation is natural and appropriate for the target dialect."
+  - label: "Tone & Suitability"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Tone is inappropriate or harmful for the content."
+      2: "Tone often feels off or distracting from the content."
+      3: "Tone is acceptable but occasionally inappropriate."
+      4: "Generally appropriate tone with small mismatches."
+      5: "Tone fits the content and use-case perfectly."
+  - label: "Voice quality"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Unusable voice quality."
+      2: "Poor quality with frequent artifacts."
+      3: "Noticeable quality issues but still usable."
+      4: "Minor artifacts but overall high quality."
+      5: "Natural, pleasant voice with no artifacts."
+  - label: "Customization & Flexibility"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "No useful customization; inflexible."
+      2: "Very limited or brittle customization options."
+      3: "Limited customization; acceptable for simple use-cases."
+      4: "Some customization available; works well for most cases."
+      5: "Highly flexible and customizable for different styles."
+  - label: "Listening comfort"
+    min: 1
+    max: 5
+    step: 1
+    default: 3
+    explanations:
+      1: "Uncomfortable or painful to listen to."
+      2: "Often fatiguing or distracting to listen to."
+      3: "Some listening fatigue; tolerable for short durations."
+      4: "Mostly comfortable with occasional sharpness or fatigue."
+      5: "Comfortable to listen to for extended periods."

download_dataset.py ADDED Viewed

	@@ -0,0 +1,94 @@

+"""Download and merge all data files from a Hugging Face dataset repo.
+Usage:
+  HF_TOKEN must be exported in your environment (or pass --token).
+  HF_DATASET_ID may be exported or passed via --repo.
+Example:
+  export HF_TOKEN="hf_..."
+  python download_dataset.py --repo kathiasi/tts-rubric-responses --outdir out
+This script downloads any files under `data/` (parquet or arrow/ipc), reads them,
+concatenates into a single table, and writes `combined.parquet` and `combined.csv` in
+`outdir`.
+"""
+import os
+import argparse
+import json
+from huggingface_hub import HfApi, hf_hub_download
+import pyarrow.parquet as pq
+import pyarrow.ipc as ipc
+import pandas as pd
+def read_parquet(path):
+    try:
+        tbl = pq.read_table(path)
+        return tbl.to_pandas()
+    except Exception as e:
+        raise RuntimeError(f"Failed to read parquet {path}: {e}")
+def read_arrow(path):
+    try:
+        with open(path, 'rb') as f:
+            reader = ipc.open_file(f)
+            tbl = reader.read_all()
+            return tbl.to_pandas()
+    except Exception as e:
+        raise RuntimeError(f"Failed to read arrow/ipc {path}: {e}")
+def download_and_merge(repo_id, outdir, token=None):
+    api = HfApi()
+    token = token or os.environ.get('HF_TOKEN')
+    if not token:
+        raise RuntimeError('HF_TOKEN not provided; export HF_TOKEN or pass --token')
+    files = api.list_repo_files(repo_id=repo_id, repo_type='dataset', token=token)
+    data_files = [f for f in files if f.startswith('data/')]
+    if not data_files:
+        print('No data/ files found in dataset repo. Files found:')
+        print(json.dumps(files, indent=2))
+        return
+    os.makedirs(outdir, exist_ok=True)
+    dfs = []
+    for fname in sorted(data_files):
+        print('Processing', fname)
+        local_path = hf_hub_download(repo_id=repo_id, repo_type='dataset', filename=fname, token=token)
+        if fname.endswith('.parquet'):
+            df = read_parquet(local_path)
+        elif fname.endswith('.arrow') or fname.endswith('.ipc'):
+            df = read_arrow(local_path)
+        else:
+            print('Skipping unsupported data file:', fname)
+            continue
+        dfs.append(df)
+    if not dfs:
+        print('No supported data files were read.')
+        return
+    combined = pd.concat(dfs, ignore_index=True)
+    out_parquet = os.path.join(outdir, 'combined.parquet')
+    out_csv = os.path.join(outdir, 'combined.csv')
+    print(f'Writing {len(combined)} rows to', out_parquet)
+    combined.to_parquet(out_parquet, index=False)
+    print('Also writing CSV to', out_csv)
+    combined.to_csv(out_csv, index=False)
+    print('Done.')
+if __name__ == '__main__':
+    p = argparse.ArgumentParser()
+    p.add_argument('--repo', help='Dataset repo id (user/name)', default=os.environ.get('HF_DATASET_ID'))
+    p.add_argument('--outdir', help='Output directory', default='hf_dataset')
+    p.add_argument('--token', help='Hugging Face token (optional)', default=None)
+    args = p.parse_args()
+    if not args.repo:
+        print('Dataset repo id is required via --repo or HF_DATASET_ID env var')
+        raise SystemExit(1)
+    download_and_merge(args.repo, args.outdir, token=args.token)

requirements.txt ADDED Viewed

	@@ -0,0 +1,8 @@

+gradio==5.15
+numpy
+python-docx
+PyYAML
+huggingface_hub>=0.28.1
+pandas

sample-audios/sorsamisk_-_115_01_-_Goevten_voestes_biejjie_015_024_s.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d08e73cfdf0d3c6aaf1eef5b4a028b96f6dac88a0f673ed654faabdabb0b82cd
+size 827436

sample-audios/sorsamisk_-_7_01_-_Jarkoestidh_028_020.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3292ee4d5f741001bff3a8ac7e2c5d18daee9b227f7a36cf43632205b6983b91
+size 209532

sample-audios/sorsamisk_-_III_01_-_Giesie_eejehtimmiebiejjieh_015_015.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:98c6fc87df1f69d24878c4f04fce7f7713c574a1bd531001a65f66b096e0ad40
+size 295016

sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd1_mono_022_009.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f3bbcaea90f2bdb4bf798c8e4214ddf2381eedaf3ffd974509ca54d02dc5504b
+size 468702

sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd1_mono_030_014.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:90bc9504d119bc8a95ab02c16299c054725c7b32fa55e9b5b2dbe847fb297d83
+size 458130

sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd2_009_TRN_019.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:71ccd5595e3ed565eba27cd1ce48a07cd954717c196a9ea3a672bbe9cf043290
+size 147504

sample-audios/sorsamisk_goltelidh_jupmelen_rihjke_lea_gietskesne_cd2_009_TRN_019_s.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:86554d671d506fef1863a3b026ccbfb77e6fd6e2112b4380b9ccab18acd3f9a1
+size 143916