Show simple item record

dc.contributor.authorAbou Chacra, David
dc.date.accessioned2023-04-27 18:15:08 (GMT)
dc.date.available2023-04-27 18:15:08 (GMT)
dc.date.issued2023-04-27
dc.date.submitted2023-04-20
dc.identifier.urihttp://hdl.handle.net/10012/19350
dc.description.abstractDeep learning has dominated the landscape of computer vision for the past decade. Deep learning networks are the top performers on a slew of computer vision challenges (e.g., object detection or image segmentation) and on the most popular datasets. They outperform other approaches by a large margin, each armed with their own tricks to improve upon their predecessors. However recent research highlights several short-comings of deep learning approaches, from poor generalization performance to the difficulty in understanding the rationale behind the decisions they make. More nuanced and human-like tasks such as visual relationship detection still prove difficult for deep learning networks as well. In this thesis we tackle the problem of scene graph generation: the task of generating a directed graph that describes the relationships between detected objects in an image. We empirically identify, highlight and discuss the shortcomings of modern deep learning approaches to this task along with the reasoning behind these failures. Scene graph generation relies on both object detection and visual relationship detection. Our experiments first tackle object detection (through its more advanced task of instance segmentation) in isolation, then explore visual relationship detection starting with its data and moving on to its deep learning based approaches. Finally we propose and implement Topological Relationship Fields, a novel approach that allows for representing and grounding relationships purely visually. We utilize this representation for a scene graph generation approach that builds upon our findings and tackles the problem radically differently than the current standard approaches.en
dc.language.isoenen
dc.publisherUniversity of Waterlooen
dc.subjectartificial intelligenceen
dc.subjectmachine learningen
dc.subjectcomputer visionen
dc.subjectvisual relationship detectionen
dc.subjectscene graphsen
dc.subjecthuman cognitionen
dc.subjectdataset understandingen
dc.subjectadversarial attacksen
dc.subjectdeep learningen
dc.subjectmodel explainabilityen
dc.subjectstatistical modellingen
dc.subjectinstance segmentationen
dc.subjectobject detectionen
dc.subjectnetwork generalizationen
dc.titleModern Object and Visual Relationship Detection in Images from a Critical, Cognitive and Data Perspectiveen
dc.typeDoctoral Thesisen
dc.pendingfalse
uws-etd.degree.departmentSystems Design Engineeringen
uws-etd.degree.disciplineSystem Design Engineeringen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.degreeDoctor of Philosophyen
uws-etd.embargo.terms0en
uws.contributor.advisorZelek, John
uws.contributor.affiliation1Faculty of Engineeringen
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.typeOfResourceTexten
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record


UWSpace

University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages