link

Resource for First rotation Vinnova

Use case: ICT classification and topic modelling

At the beginning of the rotation, there was an interest in identifying ICT-related projects in the Horizon 2020 health cluster. There was no labeled data so we went for a keyword-based approach and created a list of keywords that would help us find ICT-related projects. We later improved the method by taking into account whether the topic of each project was ICT-related or not and by considering the number of keyword matches in the project abstract. 

The next step was topic modelling – we worked with both LDA and BERTopic - to visualize what the projects were about, and which topic clusters could be identified. Lastly, we worked with network visualization in order to be able to look into the relationships between actors taking part in these projects.

Attributes

NLP
Textual Data