dc.contributor.author | Mohammed Sajjad Jafri, Mohammed Sajjad Jafri | |
dc.date.accessioned | 2024-03-22 15:50:42 (GMT) | |
dc.date.available | 2024-03-22 15:50:42 (GMT) | |
dc.date.issued | 2024-03-22 | |
dc.date.submitted | 2024-03-19 | |
dc.identifier.uri | http://hdl.handle.net/10012/20403 | |
dc.description.abstract | In this thesis, we introduce a centralized performance monitoring infrastructure. In the current computing landscape, performance monitoring architectures are becoming more and more important for different academic and industrial applications. Performance counters reveal valuable insight into the functioning of the platform. This information can then be exploited for debugging applications, improving performance, identifying bottlenecks, and much more. In our proposed infrastructure, we envision a configurable Advanced Performance Monitoring Unit (APMU) connected to a set of monitoring Event Units (EVU) that are installed in various hardware system IPs across the platform. These EVUs send hardware event information to the APMU. The APMU has smart counters that are capable of operating on the incoming events, and an instruction processor that can implement any desired software mechanisms on the counter data. Our design allows for an efficient collection and correlation of event data, allowing the APMU to get a more holistic insight into the system behaviour, revealing microarchitecture-specific information. We intend to allow users the ability to develop EVUs for IPs relevant to them. For instance, in the implementation phase of this work, we developed an AXI4-based Snooping Unit as a concrete example of a custom-EVU. Therefore, to help integrate such custom EVUs with an APMU infrastructure, we also standardize an EVU-APMU interface. We provide the specification for this interface, ensuring that users can connect any custom-EVU to an APMU, as long as both abide by the interface specification.
In this work, we implement two design IPs. One is the previously mentioned AXI4-based Snooping Unit and the other is a RISC-V compliant APMU. We also provide a software stack to support programming on its processor. The implemented design is emulated on an AMD Virtex UltraScale+ FPGA VCU118 device. To evaluate the implementation of our design, we present the hardware synthesis results for the FPGA, and the execution results of a latency-based regulation case study, demonstrating the functionality of our design. | en |
dc.language.iso | en | en |
dc.publisher | University of Waterloo | en |
dc.subject | Performance Monitoring Infrastructure | en |
dc.subject | RISCV | en |
dc.subject | FPGA | en |
dc.subject | Resource regulation | en |
dc.subject | AXI4 Protocol | en |
dc.subject | Memory contention | en |
dc.subject | Event units | en |
dc.subject | Advanced Performance Monitoring Unit | en |
dc.title | A Centralized System Performance Monitoring Infrastructure | en |
dc.type | Master Thesis | en |
dc.pending | false | |
uws-etd.degree.department | Electrical and Computer Engineering | en |
uws-etd.degree.discipline | Electrical and Computer Engineering | en |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.degree | Master of Applied Science | en |
uws-etd.embargo.terms | 0 | en |
uws.contributor.advisor | Pellizzoni, Rodolfo | |
uws.contributor.affiliation1 | Faculty of Engineering | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.typeOfResource | Text | en |
uws.peerReviewStatus | Unreviewed | en |
uws.scholarLevel | Graduate | en |