ملخص البحث :
Drug abuse pertains to the consumption of a substance that may induce adverse effects to a person. In
international security studies, drug trafficking has become an important topic. In this regard, drug-related
crimes are identified as an extremely significant challenge faced by any community. Several techniques for
investigations in the crime domain have been implemented by many researchers. However, most of these
researchers focus on extracting general crime entities. The number of studies that focus on the drug crime
domain is relatively limited. This paper mainly aims to propose a rule-based named entity recognition
model for drug-related crime news documents. In this work, a set of heuristic and grammatical rules is used
to extract named entities, such as types of drugs, amount of drugs, price of drugs, drug hiding methods, and
the nationality of the suspect. A set of grammatical and heuristic rules is established based on part-ofspeech
information, developed gazetteers, and indicator word lists. The combined approach of heuristic and
grammatical rules achieves a good performance with an overall precision of 86%, a recall of 87%, and an
F1-measure of 87%. Results indicate that the ensemble of both heuristic and grammatical rules improves
the extraction effectiveness in terms of macro-F1 for all entities
-
سنة النشر : 2015
-
تصنيف البحث : scopus
- تحميل