Please use this identifier to cite or link to this item: http://idr.iimranchi.ac.in:8080/xmlui/handle/123456789/235
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMukherjee, Shubhadeep.-
dc.contributor.authorBala, Pradip Kumar.-
dc.date.accessioned2018-04-04T09:11:23Z-
dc.date.available2018-04-04T09:11:23Z-
dc.date.issued2017-02-
dc.identifier.citationMukherjee, S., & Bala, P.K. (2017). Gender classification of microblog text based on authorial style. Information Systems and e-Business Management, 15(1), 117-138. doi: https://doi.org/10.1007/s10257-016-0312-0.en_US
dc.identifier.issn1617-9846-
dc.identifier.urihttps://doi.org/10.1007/s10257-016-0312-0-
dc.identifier.urihttp://10.10.16.56:8080/xmlui/handle/123456789/235-
dc.description.abstractGender profiling of unstructured text data has several applications in areas such as marketing, advertising, legal investigation, and recommender systems. The automatic detection of gender in microblogs, like twitter, is a difficult task. It requires a system that can use knowledge to interpret the linguistic styles being used by the genders. In this paper, we try to provide this knowledge for such a system by considering different sets of features, which are relatively independent of the text, such as function words and part of speech n-grams. We test a range of different feature sets using two different classifiers; namely Naïve Bayes and maximum entropy algorithms. Our results show that the gender detection task benefits from the inclusion of features that capture the authorial style of the microblog authors. We achieve an accuracy of approximately 71 %, which outperforms the classification accuracy of commercially available gender detection software like Gender Genie and Gender Guesser.en_US
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.subjectText miningen_US
dc.subjectTwitteren_US
dc.subjectNatural language processingen_US
dc.subjectGender classificationen_US
dc.subjectKnowledge discoveryen_US
dc.subjectSupervised learningen_US
dc.subjectArtificial intelligenceen_US
dc.subjectBusiness intelligenceen_US
dc.subjectIIM Ranchien_US
dc.titleGender classification of microblog text based on authorial styleen_US
dc.typeArticleen_US
dc.volume15en_US
dc.issue1en_US
Appears in Collections:Journal Articles

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.