Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil

The subject of Brazil was analyzed within the context of the French database DocThéses, comprising the years 1969 -1999. The data mining technique was used to obtain intelligence and infer knowledge. The objective was to identify indicators concerning: occurrence of thesis by subject area...

Full description

Saved in:
Bibliographic Details
Main Authors: Kira Tarapanoff, Luc Quoniam, Rogério Henrique de Araújo Júnior, Lillian Alvares
Format: Article
Language:English
Published: University of Borås 2001-01-01
Series:Information Research: An International Electronic Journal
Subjects:
Online Access:http://informationr.net/ir/7-1/paper117.html
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832569870218166272
author Kira Tarapanoff
Luc Quoniam
Rogério Henrique de Araújo Júnior
Lillian Alvares
author_facet Kira Tarapanoff
Luc Quoniam
Rogério Henrique de Araújo Júnior
Lillian Alvares
author_sort Kira Tarapanoff
collection DOAJ
description The subject of Brazil was analyzed within the context of the French database DocThéses, comprising the years 1969 -1999. The data mining technique was used to obtain intelligence and infer knowledge. The objective was to identify indicators concerning: occurrence of thesis by subject areas; thesis supervisors identified with certain subject areas; geographical distribution of cities hosting institutions where the theses were defended; frequency by subject area in the period when the theses were defended. The technique of data mining is divided into stages which go from identification of the problem -object, through selection and preparation of data, and conclude with analysis of the latter. The software used to do the cleaning of the DocThéses database was Infotrans, and Dataview was used for the preparation of the data. It should be pointed out that the knowledge extracted is directly proportional to the value and validity of the information contained in the database. The results of the analysis were illustrated using the assumptions of Zipf's Law on bibliometrics, classifying the information as: trivial, interesting and 'noise', according to the distribution of frequency. It is concluded that the data mining technique associated with specialist software is a powerful ally when used with competitive intelligence applied at all levels of the decision -making process, including the macro level, since it can help the consolidation, investment and development of actions and policies.
format Article
id doaj-art-6910cc17812b430d8579d708a188d505
institution Kabale University
issn 1368-1613
language English
publishDate 2001-01-01
publisher University of Borås
record_format Article
series Information Research: An International Electronic Journal
spelling doaj-art-6910cc17812b430d8579d708a188d5052025-02-02T19:11:42ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132001-01-0171117Intelligence obtained by applying data mining to a database of French theses on the subject of BrazilKira TarapanoffLuc QuoniamRogério Henrique de Araújo JúniorLillian AlvaresThe subject of Brazil was analyzed within the context of the French database DocThéses, comprising the years 1969 -1999. The data mining technique was used to obtain intelligence and infer knowledge. The objective was to identify indicators concerning: occurrence of thesis by subject areas; thesis supervisors identified with certain subject areas; geographical distribution of cities hosting institutions where the theses were defended; frequency by subject area in the period when the theses were defended. The technique of data mining is divided into stages which go from identification of the problem -object, through selection and preparation of data, and conclude with analysis of the latter. The software used to do the cleaning of the DocThéses database was Infotrans, and Dataview was used for the preparation of the data. It should be pointed out that the knowledge extracted is directly proportional to the value and validity of the information contained in the database. The results of the analysis were illustrated using the assumptions of Zipf's Law on bibliometrics, classifying the information as: trivial, interesting and 'noise', according to the distribution of frequency. It is concluded that the data mining technique associated with specialist software is a powerful ally when used with competitive intelligence applied at all levels of the decision -making process, including the macro level, since it can help the consolidation, investment and development of actions and policies.http://informationr.net/ir/7-1/paper117.htmldata miningbibliometricsbibliometric analysisFrenchthesesBrazilknowledgedatabasesZipf's Law
spellingShingle Kira Tarapanoff
Luc Quoniam
Rogério Henrique de Araújo Júnior
Lillian Alvares
Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil
Information Research: An International Electronic Journal
data mining
bibliometrics
bibliometric analysis
French
theses
Brazil
knowledge
databases
Zipf's Law
title Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil
title_full Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil
title_fullStr Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil
title_full_unstemmed Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil
title_short Intelligence obtained by applying data mining to a database of French theses on the subject of Brazil
title_sort intelligence obtained by applying data mining to a database of french theses on the subject of brazil
topic data mining
bibliometrics
bibliometric analysis
French
theses
Brazil
knowledge
databases
Zipf's Law
url http://informationr.net/ir/7-1/paper117.html
work_keys_str_mv AT kiratarapanoff intelligenceobtainedbyapplyingdataminingtoadatabaseoffrenchthesesonthesubjectofbrazil
AT lucquoniam intelligenceobtainedbyapplyingdataminingtoadatabaseoffrenchthesesonthesubjectofbrazil
AT rogeacuteriohenriquedearauacutejojuacutenior intelligenceobtainedbyapplyingdataminingtoadatabaseoffrenchthesesonthesubjectofbrazil
AT lillianalvares intelligenceobtainedbyapplyingdataminingtoadatabaseoffrenchthesesonthesubjectofbrazil