Private KCH Model Description

Hideaki · May 17, 2023, 11:00am

We are using the Private KCH model below, but the model card does not have the descriptions on source ontology, training dataset, training algorithm and rationale behind CUI filters applied. Hope you could help.
KCH_model_card

We also used the public SNOMED MIMIC-III model, which has a bit more information. We’re trying to compare the two.
Public_snomed_model_card

Also, I understand that this public model does not have meta-annotation trained, may I check then what the following means?
"Status: ‘Detects is a concept affirmed or Negated/ Hypothetical’ "

We appreciate your help.

Thank you.

anthony.shek · May 18, 2023, 8:37am

Hi @Hideaki This KCH model was created prior to the creation of the modelcards. This model with the filter has some issues with it.

Please update to the latest KCH model. It has all of this information on it

The public model only has one meta-task: “Status”. The KCH one has 3: Experiencer, Presence, Time

Hideaki · May 18, 2023, 11:55am

Thank you, @anthony.shek.

Hope you can help with my understanding of with the meta-annotation result from the public model for this entity. Where does the confidence value come from and what is the purpose of ‘name’:‘Status’ (as we already know the task is ‘Status’ from the more superficial dictionary key)?.

anthony.shek · May 18, 2023, 12:32pm

To calculate a meta-model’s confidence or probability for a particular class, you would pass the logits corresponding to that class through the softmax function. The resulting value represents the model’s estimated probability or confidence for that class.

The softmax function ensures that the output values are non-negative and sum up to 1, making them interpretable as probabilities. (1 is most confident and 0 is least.) It is calculated here in the predict function.

As for the duplication of the name ‘Status’ I have no idea the rationale behind this. @tomolopolis has this structure got something to do with MCTrainer?

Hideaki · May 19, 2023, 2:18pm

Thank you, @anthony.shek.

May I check if you happen to have further information on the KCH data used for training, i.e. out of 17M documents and 8.8B tokens, what proportion is ICU, Neurology, primary care, psychiatry etc?

anthony.shek · May 22, 2023, 3:20pm

No idea this was not recorded during at the time of training. Is this something that you need?
I just you can technically retrieve this from KCH CogStack

Topic		Replies	Views
Public models for meta annotations MedCAT	5	284	April 16, 2025
MedCat meta annotation model poor functionality MedCAT	4	261	January 18, 2023
MedCAT French model only matches exact terms - accuracy similarity always 1 MedCAT	7	62	June 8, 2025
Medcat 1.7.0 trained on documents, or sentences (short documents) MedCAT	1	213	March 30, 2023
How to improve recall and make medcat find correct word combinations?	15	315	January 20, 2023

Private KCH Model Description

Related topics