ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

ANONYMIZATION OF SENSITIVE DATA IN UNSTRUCTURED DOCUMENTS USING NLP

Journal: International Journal of Mechanical Engineering and Technology(IJMET) (Vol.12, No. 04)

Publication Date:

Authors : ;

Page : 25-35

Keywords : CRF; NE; Natural language Processing; unstructured data; anonymization.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Lot of researchers have worked for the progress of anonymization of structured data through spread-sheets and database tools. Masking of sensitive information in structured data and data anonymization is possible through algorithms or techniques. But anonymizing unstructured data is a real challenge since data currently exists in different form. The study which ensures to cope with the interactions between human language and computers is called NLP. Natural Language Processing is the sub-field of AI which focuses on enabling computers to understand and process human languages. Further, we provide the deeper insight on how NLP works and show how a system can understand unstructured text, extract sensitive data and perform anonymization. The proposed anonymization procedure provides a system to apply text anonymization on unstructured original medical-records of an individual and release the anonymized document to help researchers for further study or investigation by preserving the privacy of the concerned individual

Last modified: 2021-06-05 12:07:42