Categorize "scientific"/"non scientific" medical documents #4

asittampalam · 2017-03-17T10:15:51Z

In a second step we could extract the "scientific" medical documents from our positive set.

tschimbr · 2017-03-17T10:18:05Z

In order to do this we will label top level domains as scientific / pseudo-scientific / trivial by an expert (medical doctor, coder..)

tschimbr · 2017-03-17T12:13:58Z

Use these two sets in order to create a translation service from professional scientific texts to non scientific texts easily understandable by patients.

asittampalam · 2017-07-21T14:49:20Z

Labeled as scientific (to be updated):
springer.com
pharmazeutische-zeitung.de
springermedizin.at
med2click.de
pathologie-online.de
clinicum.at

Labeled as non-scientific (to be updated):
netdoktor.de
diabetes-ratgeber.net
planet-wissen.de
focus.de
spektrum.de
gesundheitsinformation.de
medizin-transparent.at
haut-ratgeber.ch

asittampalam · 2017-07-21T15:29:10Z

Maybe we could use something like https://link.springer.com/article/10.1023%2FA%3A1007692713085?LI=true (Text Classification from Labeled and Unlabeled Documents using EM - I haven't read it yet) in order to start with a small labeled set (e.g. part of "scientific") and to use a large unlabeled set (e.g. "scientific" + "non-scientific") as leverage in order to learn a stable "scientific"/"non-scientific" classifier.

tschimbr · 2019-04-02T08:39:16Z

Create a data set with pairs of synonyms, one being scientific, the other being non-scientific:

calculate the vector embedding difference between these synonyms
How different are the differences?
average the differences
translate new scientific words to non-scientific words

Maybe test out in https://github.com/eonum/medword

asittampalam self-assigned this Mar 17, 2017

tschimbr unassigned asittampalam Oct 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Categorize "scientific"/"non scientific" medical documents #4

Categorize "scientific"/"non scientific" medical documents #4

asittampalam commented Mar 17, 2017

tschimbr commented Mar 17, 2017

tschimbr commented Mar 17, 2017

asittampalam commented Jul 21, 2017

asittampalam commented Jul 21, 2017 •

edited

Loading

tschimbr commented Apr 2, 2019 •

edited

Loading

Categorize "scientific"/"non scientific" medical documents #4

Categorize "scientific"/"non scientific" medical documents #4

Comments

asittampalam commented Mar 17, 2017

tschimbr commented Mar 17, 2017

tschimbr commented Mar 17, 2017

asittampalam commented Jul 21, 2017

asittampalam commented Jul 21, 2017 • edited Loading

tschimbr commented Apr 2, 2019 • edited Loading

asittampalam commented Jul 21, 2017 •

edited

Loading

tschimbr commented Apr 2, 2019 •

edited

Loading