Skip to content

codertimo/KorQuAD-Question-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

17 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Question Generation(QG) Model with KorQuAD

ํ•™์Šต๋œ SKT-AI/KoGPT2 ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์งˆ๋ฌธ ์ƒ์„ฑ QG(Question Generation) ๋ชจ๋ธ์„ ๋งŒ๋“ค์—ˆ์Šต๋‹ˆ๋‹ค. QG ๋ชจ๋ธ์„ ๋งŒ๋“ค๊ธฐ ์œ„ํ•ด Question Answering ๋ฐ์ดํ„ฐ์…‹์ธ KorQuAD v1.0์„ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.

์‚ฌ์šฉ ๋ฐฉ๋ฒ•

๋ฐ์ดํ„ฐ ์ค€๋น„

ํ•™์Šต/ํ‰๊ฐ€/์ƒ์„ฑ์„ ์œ„ํ•ด์„œ KorQuAD v1.0 ๋ฐ์ดํ„ฐ์…‹์„ ๋‹ค์šด ๋ฐ›์Šต๋‹ˆ๋‹ค.

make prepare-dataset

ํ•™์Šต

๋‹ค์Œ ์ปค๋งจ๋“œ๋ฅผ ์ด์šฉํ•ด์„œ ํ•™์Šต์„ ์ˆ˜ํ–‰ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

python -m scripts.run_fine_tune --train-batch-size 16 --eval-batch-size 16 --epochs 5

์„ฑ๋Šฅ ํ‰๊ฐ€ (dev ์…‹ PPL ์ธก์ •)

MODEL_PATH = "artifacts/gpt2_xxxxxxxx/gpt2_step_x.pth"
python -m scripts.run_evaluation --model-path $MODEL_PATH --batch-size 50

์งˆ๋ฌธ ์ƒ์„ฑ (dev ์…‹์— ๋Œ€ํ•ด์„œ ์งˆ๋ฌธ ์ƒ์„ฑ)

Decoding ๊ฒฐ๊ณผ

Question Generation POC ์Šคํ”„๋ ˆ๋“œ ์‹œํŠธ: KorQuAD v1.0 dev ์…‹์— ๋Œ€ํ•ด์„œ decoding ํ•œ ๊ฒฐ๊ณผ ์ž…๋‹ˆ๋‹ค.

beam-search ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ decoding ๋˜์—ˆ์œผ๋ฉฐ, beam_size ๋Š” 5๋ฅผ ์‚ฌ์šฉํ•˜์˜€์Šต๋‹ˆ๋‹ค.

MODEL_PATH = "artifacts/gpt2_xxxxxxxx/gpt2_step_x.pth"
python -m scripts.run_generate --model-path $MODEL_PATH --output-path decoded.tsv

ํ•™์Šต๋œ QG ๋ชจ๋ธ ๋‹ค์šด๋กœ๋“œ

Author

by Junseong Kim (Scatter Lab, Pingpong AI) codertimo@gmail.com

About

question generation model with KorQuAD dataset

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published