Show simple item record

dc.contributor.authorZhang, Dake
dc.contributor.authorVakili Tahami, Amir
dc.contributor.authorAbualsaud, Mustafa
dc.contributor.authorSmucker, Mark
dc.date.accessioned2022-05-11 18:27:14 (GMT)
dc.date.available2022-05-11 18:27:14 (GMT)
dc.date.issued2022-07
dc.identifier.urihttps://doi.org/10.1145/3477495.3531812
dc.identifier.urihttp://hdl.handle.net/10012/18257
dc.descriptionPermission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. SIGIR ’22, July 11–15, 2022, Madrid, Spain © 2022 Copyright held by the owner/author(s). Publication rights licensed to ACM. ACM ISBN 978-1-4503-8732-3/22/07...$15.00 https://doi.org/10.1145/3477495.3531812en
dc.description.abstractWhen searching the web for answers to health questions, people can make incorrect decisions that have a negative effect on their lives if the search results contain misinformation. To reduce health misinformation in search results, we need to be able to detect documents with correct answers and promote them over documents containing misinformation. Determining the correct answer has been a difficult hurdle to overcome for participants in the TREC Health Misinformation Track. In the 2021 track, automatic runs were not allowed to use the known answer to a topic’s health question, and as a result, the top automatic run had a compatibility-difference score of 0.043 while the top manual run, which used the known answer, had a score of 0.259. The compatibility-difference measures the ability of methods to rank correct and credible documents before incorrect and non-credible documents. By using an existing set of health questions and their known answers, we show it is possible to learn which web hosts are trustworthy, from which we can predict the correct answers to the 2021 health questions with an accuracy of 76%. Using our predicted answers, we can promote documents that we predict contain this answer and achieve a compatibility-difference score of 0.129, which is a three-fold increase in performance over the best previous automatic method.en
dc.description.sponsorshipNatural Sciences and Engineering Research Council of Canada, RGPIN-2020-04665, RGPAS- 2020-00080 || Mitacs || Compute Canada || University of Waterloo.en
dc.language.isoenen
dc.publisherACMen
dc.relation.urihttps://github.com/UWaterlooIR/golden-gazeen
dc.subjectHealth Misinformationen
dc.subjectStance Detectionen
dc.subjectWeb Searchen
dc.titleLearning Trustworthy Web Sources to Derive Correct Answers and Reduce Health Misinformation in Searchen
dc.typeConference Paperen
dcterms.bibliographicCitationDake Zhang, Amir Vakili Tahami, Mustafa Abualsaud, and Mark D. Smucker. 2022. Learning Trustworthy Web Sources to Derive Correct Answers and Reduce Health Misinformation in Search. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22), July 11–15, 2022, Madrid, Spain. ACM, New York, NY, USA, 6 pages. https://doi.org/10.1145/3477495.3531812en
uws.contributor.affiliation1Faculty of Engineeringen
uws.contributor.affiliation1Faculty of Mathematicsen
uws.contributor.affiliation2David R. Cheriton School of Computer Scienceen
uws.contributor.affiliation2Management Sciencesen
uws.typeOfResourceTexten
uws.peerReviewStatusRevieweden
uws.scholarLevelFacultyen
uws.scholarLevelGraduateen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record


UWSpace

University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages