1 2 14

Lei Hsiung

hsiung

https://hsiung.cc/

AI & ML interests

Trustworthy ML

Recent Activity

authored a paper 4 days ago

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

authored a paper 4 days ago

Spectral Insights into Data-Oblivious Critical Layers in Large Language Models

authored a paper 4 days ago

NCTV: Neural Clamping Toolkit and Visualization for Neural Network Calibration

View all activity

Organizations

authored 3 papers 4 days ago

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Paper • 2506.05346 • Published Jun 5, 2025

Spectral Insights into Data-Oblivious Critical Layers in Large Language Models

Paper • 2506.00382 • Published May 31, 2025

NCTV: Neural Clamping Toolkit and Visualization for Neural Network Calibration

Paper • 2211.16274 • Published Nov 29, 2022

updated a Space 6 days ago

NCTV: Neural Clamping Toolkit and Visualization

🦀

Model-agnostic Toolkit for Neural Network Calibration

liked 3 datasets 8 days ago

liked 2 models 3 months ago

Salesforce/GTA1-7B-2507

Image-Text-to-Text • 8B • Updated Oct 3, 2025 • 461 • 3

mPLUG/GUI-Owl-7B

8B • Updated Aug 22, 2025 • 626 • 51

updated a dataset 3 months ago

hsiung/ultrachat_beavertails

Updated Oct 10, 2025 • 1

liked a model 3 months ago

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated Sep 17, 2025 • 2.39M • • 640

liked 2 datasets 6 months ago

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 131 • 74

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 190k • 2.61k

liked a dataset 8 months ago

AmazonScience/FalseReject

Viewer • Updated May 14, 2025 • 15.8k • 346 • 30

updated a dataset 11 months ago

hsiung/beavertails_chat

Viewer • Updated Mar 4, 2025 • 19.3k • 46

liked 2 datasets 11 months ago

kaist-ai/CoT-Collection

Viewer • Updated Oct 14, 2023 • 1.84M • 1.43k • 154

kaist-ai/Multilingual-CoT-Collection

Updated Oct 14, 2023 • 193 • 26

updated a Space 11 months ago

README

🚀

updated 2 models about 1 year ago

hsiung/samsum_high_sim_5k

Text Generation • 7B • Updated Jan 1, 2025 • 3

hsiung/samsum_low_sim_5k

Text Generation • 7B • Updated Jan 1, 2025 • 3

Lei Hsiung

AI & ML interests

Recent Activity

Organizations

hsiung's activity

NCTV: Neural Clamping Toolkit and Visualization

README