Coreference Resolution

Introduction

This repository contains "Korean Coreference Resolution model".
This model is based on Kenton Lee's English Coreference Resolution model. And we applied it to Korean Coreference Resolution with referencing Shin et al.

Getting Started

Install python3 requirements: pip3 install -r requirements.txt
Build custom kernels by running setup_all.sh.
- There are 3 platform-dependent ways to build custom TensorFlow kernels. Please comment/uncomment the appropriate lines in the script.
Download word2vec.txt and save at here.

Training Instructions

./train_coref.sh preprocesses data before train.
Experiment configurations are found in experiments.conf
Choose an experiment. Change paths of data, word embedding and other parameters which you would like.
Training: python3 train.py <experiment>
Results are stored in the logs directory and can be viewed via TensorBoard.
Evaluation: python3 evaluate.py <experiment>

Pretrained model & ELMo embedding

logs directory have a pretrained model, MTA02-test.
- MTA02-test is a pretrained model of crowdsourcing data set.
If you want to use pretrained ELMo embedding, download it in the input directory.

Others

The training terminates automatically at 30k steps. The model generally converges at about 25k steps.
If there are some errors when evaluating the development set, v4_gold_conll file may have errors. So, you should change the train, dev. set path of verify_conll.py and run it. Then, you may find some errors and fix them.
- Most of these kind of errors are caused by ETRI morphological analysis.

References

Licenses

CC BY-NC-SA Attribution-NonCommercial-ShareAlike
If you want to commercialize this resource, please contact to us

Publisher

Machine Reading Lab @ KAIST

Acknowledgement

This work was supported by Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government(MSIT) (2013-0-00109, WiseKB: Big data based self-evolving knowledge base and reasoning platform)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
input		input
logs/MTA02-test		logs/MTA02-test
.gitattributes		.gitattributes
.gitignore		.gitignore
ETRI-coref.py		ETRI-coref.py
README.md		README.md
cache_elmo.py		cache_elmo.py
character_evaluate.py		character_evaluate.py
command.sh		command.sh
conll.py		conll.py
coref_kernels.cc		coref_kernels.cc
coref_model.py		coref_model.py
coref_model.pyc		coref_model.pyc
coref_ops.py		coref_ops.py
coref_ops.pyc		coref_ops.pyc
coreference.sh		coreference.sh
dev_conll.log		dev_conll.log
entity_type_kbox		entity_type_kbox
etri.py		etri.py
etri.pyc		etri.pyc
evaluate.py		evaluate.py
evaluate_result.pickle		evaluate_result.pickle
experiments.conf		experiments.conf
filter_embeddings.py		filter_embeddings.py
get_char_vocab.py		get_char_vocab.py
jisi_determiner_list.txt		jisi_determiner_list.txt
jsonlines_to_json.py		jsonlines_to_json.py
link_character.py		link_character.py
make_conll.log		make_conll.log
make_conll.py		make_conll.py
make_document.py		make_document.py
metrics.py		metrics.py
minimize.py		minimize.py
pre_dump.py		pre_dump.py
predict.py		predict.py
pronoun_detect_2.py		pronoun_detect_2.py
pronoun_lemma_list.txt		pronoun_lemma_list.txt
requirements.txt		requirements.txt
rule_evaluate.py		rule_evaluate.py
setup_all.sh		setup_all.sh
train.py		train.py
train_coref.sh		train_coref.sh
util.py		util.py
util.pyc		util.pyc
util2.py		util2.py
verify_conll.py		verify_conll.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Coreference Resolution

Introduction

Getting Started

Training Instructions

Pretrained model & ELMo embedding

Others

References

Licenses

Publisher

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

machinereading/CR

Folders and files

Latest commit

History

Repository files navigation

Coreference Resolution

Introduction

Getting Started

Training Instructions

Pretrained model & ELMo embedding

Others

References

Licenses

Publisher

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages