[2022]BERN2:an advanced neural biomedical named entity recognition and normalization tool

September 6, 2023

짧은 요약(Abstract) :   
* BERN2 제안하여 BIO NER, NEN 성능 개선

Paper with my notes

~~Lecture link~~

단어정리

collate: 대조하다, 맞추어 보다, 함께 합치다

1 Introduction

Biomedical 텍스트마이닝 중요(관련 텍스트 지속 증가)
** biomedical entity(gene/protein/disease) 또한 지속 증가
** bio NER, NEN(named entity normalization) 에 사용 가능
기존 것 약점 3가지 있음
** 1. 엔티티 타입 적음(5)
** 2. 모델마다 annotation 다름
** 3. slow
BERN 제안
** 9가지 biomedical entity types
** reduce annotation time
** combine rule-base and neural network based NEN -> improve the quality of NEN
</br>

2 Materials and methods

2.1 Multi-task named entity recognition

NER 여러개 멀티태스크러닝으로 돌려서 빠름
** BC2GM - gene/protein
** NCBI-disease - disease
** BC4CHEMD - drug/chemical
** Linnaeus - species
** JNLPBA - cell line, cell type, DNA, and RNA

2.2 Hybrid named entity normalization

rule + nn