Medcat Trainer configuration

Gunavardhan · May 24, 2022, 8:53am

The MedCAT Trainer is taking a lot of time to load even though it is set in the high configuration. The configuration currently used is 8 CPU 46 GB RAM which seems to be insufficient. Any thoughts on the configuration required ?

anthony.shek · May 24, 2022, 10:58pm

Hi @Gunavardhan,

8 CPU 46 GB RAM, should be more than sufficient to run MedCATtrainer.
Can you tell me a few more details about the issue:

What exactly is slow/failing? Is it building the image? slow loading a project? admin page? etc…
Have you checked the resources allocated to your docker instance? CPU, Memory, swap etc…
If it is slow loading a project. How many documents have you assigned to a project?
Your deployed Version/tag of MedCATtrainer

Thanks

Gunavardhan · May 25, 2022, 5:46am

Hi @anthony.shek

It is slow in loading a project, In a project we are having 546 documents

medcat-trainer:v2.1.1

Thanks in advance

tomolopolis · May 25, 2022, 9:21am

Hi @Gunavardhan - that’s quite an old version.

The latest is v2.3.0, could you git pull and re-run docker-compose?

Gunavardhan · May 25, 2022, 9:36am

Hi @anthony.shek

Will try and come back to you

Thanks in advance

tomolopolis · May 25, 2022, 9:38am

Please note that the MedCAT library version is different between these versions of the the Trainer, so current MedCAT models you’ve got loaded into the trainer may not work with v2.3.0

kashmirabhake · July 11, 2022, 6:22am

Hi All,

Our MedCAT trainer version is 2.3.4. These are our findings:

The loading time of each note in the MedCAT Trainer depends upon the length of that particular note.
In a dataset we have length of clinical notes (count of characters in a note) varying from 122219 to 118. Hence, the loading time of note varies from 5 mins to 2-3 secs.
For notes with higher length - sifting from one annotated value to other in a clinical note, or performing any operation on the annotated term, the page renders unresponsive.

How can we tackle this issue of huge loading time without reducing the size of the clinical note?

anthony.shek · July 20, 2022, 7:50pm

Hi @kashmirabhake,

What are the PC hardware specs that you are running the MedCAT trainer on?

anthony.shek · July 20, 2022, 9:12pm

One could run it on a small 4 core, 8gb machine, but that would be slow if running the full snomed terminology medcat model, I.e. The 2 - 3gb model.

kashmirabhake · July 21, 2022, 9:14am

Hi @anthony.shek thank you for your inputs.
Our system configuration is: 8 CPU 46 GB RAM
We need to run the full medcat model for our requirement.
Here are some more additional findings:
The time taken to load a note depends on 2 factors:

Length of the individual note
Number of concept ids to be annotated in a note irrespective their occurrences in a note

Out trainers are facing difficulties in loading a large note and training it as the page renders unresponsive. Having said this we are only interested in only 50 semantic types(tuis) out of 127. We believe that filtering the semantic types might help us reduce the loading time. Is there a way to be able to have a filter for only those semantic types that I am interested in?

tomolopolis · July 21, 2022, 3:44pm

In terms of document loading time - if you don’t set a project filter cui list, either directly or via .json file, you’re then annotating for ‘all concepts’ as configured within the medcat model. This is not advised both from a it takes ages to load long documents, and is also very painful for a human being to annotate,

anthony.shek · July 22, 2022, 11:47am

Exactly. In the project annotate entities tab: Enter your white list concept filter as follows:

Topic		Replies	Views
Deployment of MedcatTrainer latest version MedCAT	2	229	May 18, 2023
Medecat Trainer Missing Annotations MedCAT	3	219	January 17, 2023
Error messages when uploading current UMLS full model to MedCATrainer MedCAT	5	63	January 13, 2025
How to improve recall and make medcat find correct word combinations?	15	331	January 20, 2023
Medcat 1.7.0 trained on documents, or sentences (short documents) MedCAT	1	226	March 30, 2023

Medcat Trainer configuration

Related topics