SurvTRACE: Transformers for Survival Analysis with Competing Events

Last update: Oct 06, 2022

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

This repo provides the implementation of SurvTRACE for survival analysis. It is easy to use with only the following codes:

from survtrace.dataset import load_data
from survtrace.model import SurvTraceSingle
from survtrace import Evaluator
from survtrace import Trainer
from survtrace import STConfig

# use METABRIC dataset
STConfig['data'] = 'metabric'
df, df_train, df_y_train, df_test, df_y_test, df_val, df_y_val = load_data(STConfig)

# initialize model
model = SurvTraceSingle(STConfig)

# execute training
trainer = Trainer(model)
trainer.fit((df_train, df_y_train), (df_val, df_y_val))

# evaluating
evaluator = Evaluator(df, df_train.index)
evaluator.eval(model, (df_test, df_y_test))

print("done!")

🔥 See the demo

Please refer to experiment_metabric.ipynb and experiment_support.ipynb !

🔥 How to config the environment

Use our pre-saved conda environment!

conda env create --name survtrace --file=survtrace.yml
conda activate survtrace

or try to install from the requirement.txt

pip3 install -r requirements.txt

🔥 How to get SEER data

Go to https://seer.cancer.gov/data/ to ask for data request from SEER following the guide there.
After complete the step one, we should have the following seerstat software for data access. Open it and sign in with the username and password sent by seer.

Use seerstat to open the ./data/seer.sl file, we shall see the following.

Click on the 'excute' icon to request from the seer database. We will obtain a csv file.

move the csv file to ./data/seer_raw.csv, then run the python script process_seer.py, as
```
python process_seer.py
```
we will obtain the processed seer data named seer_processed.csv.

📝 Functions

single event survival analysis
competing events survival analysis
multi-task learning
automatic hyperparameter grid-search

😄 If you find this result interesting, please consider to cite this paper:

@article{wang2021survtrace,
      title={Surv{TRACE}: Transformers for Survival Analysis with Competing Events}, 
      author={Zifeng Wang and Jimeng Sun},
      year={2021},
      eprint={2110.00855},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

SurvTRACE: Transformers for Survival Analysis with Competing Events

Related tags

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

🔥 See the demo

🔥 How to config the environment

🔥 How to get SEER data

📝 Functions

😄 If you find this result interesting, please consider to cite this paper:

Owner

Zifeng

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Converts text into a PDF of handwritten notes

A relatively simple python program to generate one of those reddit text to speech videos dominating youtube.

Contains descriptions and code of the mini-projects developed in various programming languages

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

Biterm Topic Model (BTM): modeling topics in short texts

The entmax mapping and its loss, a family of sparse softmax alternatives.

A natural language modeling framework based on PyTorch

Search msDS-AllowedToActOnBehalfOfOtherIdentity

The Sudachi synonym dictionary in Solar format.

Pytorch NLP library based on FastAI

leaking paid token generator that was a shit lmao for 100$ haha

ZUNIT - Toward Zero-Shot Unsupervised Image-to-Image Translation

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Trains an OpenNMT PyTorch model and SentencePiece tokenizer.

A unified tokenization tool for Images, Chinese and English.

Common Voice Dataset explorer

:mag: Transformers at scale for question answering & neural search. Using NLP via a modular Retriever-Reader-Pipeline. Supporting DPR, Elasticsearch, HuggingFace's Modelhub...

Vad-sli-asr - A Python scripts for a speech processing pipeline with Voice Activity Detection (VAD)

An open source library for deep learning end-to-end dialog systems and chatbots.