Subject Domain Extraction and Classification Model

Time

From 2017-08 to 2017-12

Project Introduction

Aiming at the problem that the growing Chinese scientific and technological literature cannot be classified automatically. Manual reading and tagging will increase labor costs, and manual methods rely too much on the professional skills of practitioners.

Procedure

Therefore, we automatically classify the scientific and technological literature by designing the deep learning network, and the reference class is the CNKI. In other words, the project is an application of “Text Multi-label”. By completing the text multi-label network, we test and set different thresholds so that the distribution of scientific and technical documents is as close as possible to the original distribution, so as to improve the whole system.

Zhao Qiuhan
Zhao Qiuhan
Ph.d candidate

My research interests include Natural Language Processing, Deep Learning , Data Science and it’s application in Science Economy. If you get interets in my research topics, please contact me as zhaoqiuhan2019@outlook.com.

Related