File size: 882 Bytes
d981182
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
440e679
 
cc2ba52
 
 
 
440e679
 
d981182
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
language: [en, ko]
license: unknown
tags:
- roberta
- sequence-classification
- code
- small
inference: false
library_name: transformers
pipeline_tag: text-classification
datasets:
- dacon
---

# code-sim-roberta-small

RoBERTa-small์„ ์ฝ”๋“œ ์œ ์‚ฌ๋„ ๋ถ„๋ฅ˜ ํƒœ์Šคํฌ๋กœ ํŒŒ์ธํŠœ๋‹ํ•œ ๊ฐ€์ค‘์น˜์ž…๋‹ˆ๋‹ค.

Task : https://dacon.io/competitions/official/235900/overview/description

Decription : ๋‘ ์ฝ”๋“œ๊ฐ„ ์œ ์‚ฌ์„ฑ(๋™์ผ ๊ฒฐ๊ณผ๋ฌผ ์‚ฐ์ถœ ๊ฐ€๋Šฅํ•œ์ง€) ์—ฌ๋ถ€๋ฅผ ํŒ๋‹จํ•  ์ˆ˜ ์žˆ๋Š” AI ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ฐœ๋ฐœ

์‚ฌ์šฉ pretrained_model : "hosung1/roberta_small_mlm_from_scratch"

์‚ฌ์šฉ Datasets : Dacon์ œ๊ณต

## How to use
```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tok = AutoTokenizer.from_pretrained("hosung1/code-sim-roberta-small")
mdl = AutoModelForSequenceClassification.from_pretrained("hosung1/code-sim-roberta-small")