summarization / README.md

Add evaluation results on the autoevaluate--xsum-sample config and test split of autoevaluate/xsum-sample

fbc0b73 about 3 years ago

4.09 kB

	---
	license: apache-2.0
	tags:
	- generated_from_trainer
	- summarization
	datasets:
	- xsum
	- autoevaluate/xsum-sample
	metrics:
	- rouge
	model-index:
	- name: summarization
	results:
	- task:
	type: text2text-generation
	name: Sequence-to-sequence Language Modeling
	dataset:
	name: xsum
	type: xsum
	args: default
	metrics:
	- type: rouge
	value: 23.9405
	name: Rouge1
	- task:
	type: summarization
	name: Summarization
	dataset:
	name: autoevaluate/xsum-sample
	type: autoevaluate/xsum-sample
	config: autoevaluate--xsum-sample
	split: test
	metrics:
	- type: rouge
	value: 18.3598
	name: ROUGE-1
	verified: true
	verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMGJiZThmYzcwMDU4MjI4MjZlNTBjNDQ5MjYxZDNiNzU3Y2Y5OWMxZmZjODAwYTI0YTdkZTZmZTVjMmI3MGY0MSIsInZlcnNpb24iOjF9.KscbYOZebwlZfpNk-X0c54yQc7T4sXDa-ICk3WMLBsDFYAT4RGeSOa7YZTbBKaQ9ebMgl9adQ0PPV4u6t3vNBQ
	- type: rouge
	value: 3.0796
	name: ROUGE-2
	verified: true
	verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZWY2NjViZWYxODYyYWQyYzliYjVlY2NiYzU2YmYyNzQ5MTE5NTE0MjhhMTU3NDk0YjVhYjRmOGM3YmNjOGE1NyIsInZlcnNpb24iOjF9.vq-_esb4DMnlJ_lbzSY_AClBMSBnTNuzCKeVkIEc_vxqXrKI7Dz7pYkxnzGXOznXc0gTkfGGp7kOUzYDc3cdDQ
	- type: rouge
	value: 14.9038
	name: ROUGE-L
	verified: true
	verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMGQ1ODA0NzA1N2QzZGQ4MDgyMWE0YzQ1ZmQ5YWFhZTEzYmZhMjUwYTYyOWU5MzdjZGUxNThkZmQ4OGY0MmVkNiIsInZlcnNpb24iOjF9.BFaFzEuNfDuwqpiFjb8HY6uRQdgzg41plZuU8eEeBjHJSF1QvNwA6oWvUCSToT-LqftjYuoy_-jgNsFd-sziCA
	- type: rouge
	value: 14.8069
	name: ROUGE-LSUM
	verified: true
	verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTE0MmU4MDAzZjI4ZDFiZTEzZWQwMGMzZGZmZTM1ZmM3ZTM4ZWE1OWFmZTI3NDg3OTA0ZDRlNzU5YzQ0NWI1ZSIsInZlcnNpb24iOjF9.aMSrcZ0bh_H66JSnClCOYiozFPUSa9xxKn_4xjqRGsjNX9nv6ELVeNpPJNO4w8gbZxT8RkeZJx99_t7F_7u6BA
	- type: loss
	value: 3.009582281112671
	name: loss
	verified: true
	verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjJkMmRiNmMzYTAzMWVhNTJjYmM2ZjRkZDg1M2FjYTdiNmUyY2RmYjIwYjdlODQ3OTY3YjI0ZWUwNWFjNWEyZCIsInZlcnNpb24iOjF9.3z4IZp7P5WPZ3lFyjTcHVMZy2eKhlh8sp6zZno8XstvFQqt7vcSljfx1sH_9GcC8xtNL0b83r2qZKL8Zc_8gCQ
	- type: gen_len
	value: 18.05
	name: gen_len
	verified: true
	verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMDgzZDBjMzZmMmQ5Zjc3NDYwYzhmNGY2ZDA1ZDlkMGI5OTU2N2RkYjUzZGNiM2YwYTU4MDhiZTQxNTkxZDIyNyIsInZlcnNpb24iOjF9.FIorCL9Gpp2MoftgKvST5bj_WTjDP7KkxclK1JOiN9dTyzQDsaG1wIoUewm4NV9BMXTDJkFORi39DypL9NRZBw
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# summarization

	This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
	It achieves the following results on the evaluation set:
	- Loss: 2.6690
	- Rouge1: 23.9405
	- Rouge2: 5.0879
	- Rougel: 18.4981
	- Rougelsum: 18.5032
	- Gen Len: 18.7376

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 2e-05
	- train_batch_size: 16
	- eval_batch_size: 16
	- seed: 42
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- training_steps: 1000
	- mixed_precision_training: Native AMP

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \| Rouge1 \| Rouge2 \| Rougel \| Rougelsum \| Gen Len \|
	\|:-------------:\|:-----:\|:----:\|:---------------:\|:-------:\|:------:\|:-------:\|:---------:\|:-------:\|
	\| 2.9249 \| 0.08 \| 1000 \| 2.6690 \| 23.9405 \| 5.0879 \| 18.4981 \| 18.5032 \| 18.7376 \|


	### Framework versions

	- Transformers 4.19.2
	- Pytorch 1.11.0+cu113
	- Datasets 2.2.2
	- Tokenizers 0.12.1