MedCAT for Heart Disease Concept NER and model fine-tuning

tomolopolis · March 31, 2022, 2:28pm

I’m interested in running MedCAT and extracting all Heart Disease concepts from some clinical text: NHSDigital SNOMED CT Browser

How do I load MedCAT with these concepts and extract them from text?

How do I collect training data and fine-tune a model for this use case?

dogs4lyfe · April 19, 2022, 12:21pm

Without going into too much detail. The steps are pretty straight forward. First build a model, then train (both in the semi-supervised and supervised training steps), then extract data.

This discourse group is pretty responsive so I tend to throw questions here and someone debugs my issues within the same day! Anyway:

Create your own model:

Construct a model. Vocab, cdb, configs etc…
Find a corpus of documents similar to the documents which you require information from and follow the Unsupervised training steps.

Once you have a pretrained model, time to fine-tune it…

Pre-trained Model:

Fine tune the model through: Use MedCATtrainer to create a labelled dataset. Supervised training and fine-tuning + Meta-annotations. Also use this labelling step to create a training dataset for our own customisable meta-annotations.
Run your model and annotate documents with the full MedCAT pipeline with MetaAnnotations
Create fancy visualisations of the insights from big data.
Show off your work to the MedCAT community through this discourse group

Topic		Replies	Views
Adding new concepts to a trained model or re-training a MedCAT model MedCAT	9	373	January 30, 2023
Medcat 1.7.0 trained on documents, or sentences (short documents) MedCAT	1	213	March 30, 2023
Medcat trained models issues MedCAT	5	302	January 16, 2024
How to improve recall and make medcat find correct word combinations?	15	315	January 20, 2023
MedCAT French model only matches exact terms - accuracy similarity always 1 MedCAT	7	63	June 8, 2025

MedCAT for Heart Disease Concept NER and model fine-tuning

Related topics