French presidential elections: What are the most efficient measures for tweets?

Bouillot, F. ; Poncelet, P. ; Roche, M. ; Ienco, D. ; Bigdeli, E. ; Matwin, S.

Type de document
Communication scientifique avec actes
Langue
Anglais
Affiliation de l'auteur
UNIVERSITE DE MONTPELLIER II UMR 5506 LIRMM MONTPELLIER FRA ; UNIVERSITE DE MONTPELLIER II UMR 5506 LIRMM MONTPELLIER FRA ; CIRAD UMR TETIS MONTPELLIER FRA ; IRSTEA MONTPELLIER UMR TETIS FRA ; UNIVERSITY OF OTTAWA CAN ; POLISH ACADEMY OF SCIENCES CANADA INSTITUTE FOR COMPUTER SIENCE UNIVERSITY OF OTTAWA CAN
Année
2012
Résumé / Abstract
Tweets exchanged over the Internet are an important source of information even if their characteristics make them difficult to analyze (e.g., a maximum of 140 characters; noisy data). In this paper, we address the problem of extracting relevant topics through tweets coming from different communities. More precisely we are interested to address the following question: which are the most relevant terms given a community. To answer this question we define and evaluate new variants of the traditional TF-IDF. Furthermore we also show that our measures are well suited to recommend a community affiliation to a new user. Experiments have been conducted on tweets collected during French Presidential and Legislative elections in 2012. The results underline the quality and the usefulness of our proposal.
Congrès
PLEAD 2012, 02/11/2012 - 02/11/2012, Ottawa, CAN

puce  Accés à la notice sur le site Irstea Publications / Display bibliographic record on Irstea Publications website

  Texte intégral / Full text

  Liste complète des notices de CemOA