Pytorch implementation of Med_BERT using BERT from Huggingface Transformers.

BERT is used to get embeddings of medical concepts.\ As pretraining tasks MLM and prolonged length of stay in the hospital (e.g. >7 days) are employed.

Rasmy, Laila, et al. "Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction." NPJ digital medicine 4.1 (2021): 1-13.

Reproducing

We use example data generated with https://github.com/synthetichealth/synthea.git and formatted using https://github.com/kirilklein/ehr_preprocess.git. The data can be found in data/raw/synthea{size} The main scripts are :
- python main_data_pretrain.py (config: dataset_pretrain.yaml) - python main_pretrain.py (configs: model.yaml, trainer.yaml) - python main_finetune.py (configs: finetune.yaml) - python main_perturb.py (configs: perturb.yaml)

Name		Name	Last commit message	Last commit date
Latest commit History 288 Commits
configs		configs
data		data
medbert		medbert
notebooks		notebooks
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
env.yml		env.yml
requirements.txt		requirements.txt
setup.py		setup.py
tutorial.ipynb		tutorial.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pytorch implementation of Med_BERT using BERT from Huggingface Transformers.

Reproducing

About

Releases

Packages

Languages

kirilklein/Med-BERT

Folders and files

Latest commit

History

Repository files navigation

Pytorch implementation of Med_BERT using BERT from Huggingface Transformers.

Reproducing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages