Published : 2015-12-30

News analysis with natural language processing techniques

Tomasz Kowalski



Mateusz Molek



Abstract

The dynamic development of natural language processing results in a growing number of products utilizing so-called speech and language technologies. On the one hand this refers to the possibility of interacting with a computer using a language that people naturally use in speech and writing; one the other – making the information contained in all sorts of texts accessible for a computer.

We present how methods for gathering and extracting information can be applied to news releases, to possibly reduce the overhead generated by republishing the same news by numerous internet information portals.

We present how web syndication can be used to gather press releases; how to process those texts in order to determine mutual similarity; and how to visualize those. We present preliminary results of an experiment with application of the above-mentioned methods to selected Polish internet portals.

Keywords:

press releases, information retrieval, information deduplication



Details

References

Statistics

Authors

Download files

pdf. (Język Polski)

Citation rules

Kowalski, T., & Molek, M. (2015). News analysis with natural language processing techniques. Fides, Ratio Et Patria. Studia Toruńskie, (3), 44–59. https://doi.org/10.56583/frp.1976

Altmetric indicators


Cited by / Share


Publisher
Wydawnictwo Akademii Zamojskiej
ul. Pereca 2, 22-400 Zamość
tel.: +48 84/638 34 44;
tel. kom. +48/ 790 331 087
fax: +48 84/ 638 35 00
University
Akademia Zamojska
ul. Pereca 2, 22-400 Zamość
tel. 84 638 34 44
fax 84 638 35 00
e-mail: rektorat@akademiazamojska.edu.pl
About:
Copyright 2021 by
OJS Support and Customization by LIBCOM
Platform & workfow by OJS/PKP