论文中文题名: | 面向阵列处理器的光电混合交换机制研究与设计 |
姓名: | |
学号: | 20207223076 |
保密级别: | 保密(1年后开放) |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工程硕士 |
学位年度: | 2023 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 集成电路设计 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2023-06-19 |
论文答辩日期: | 2023-06-02 |
论文外文题名: | Research and Design of Optoelectronic Hybrid Switching Mechanism for Array Processor |
论文中文关键词: | |
论文外文关键词: | Array processor ; Optoelectronic hybrid interconnection ; Optical circuit switching ; Optical packet switching ; H-tree network ; Reconfigurable |
论文中文摘要: |
光交换机制是片上互连通信的重要组成部分,光交换机制的选择在很大程度上影响网络的时延、吞吐等性能。然而,单一的光电路交换和光分组交换无法满足可重构阵列处理器多种应用场景下的数据传输需求。因此,论文研究并设计了面向阵列处理器的光电混合交换机制。论文主要从光电混合H树型网络研究与设计、混合交换路由开关设计与研究、簇间数据交互研究与设计和基于混合交换机制的长短记忆网络(Long Short Term Memory, LSTM)模型搭建与FPGA测试四个方面展开。 首先,根据项目组研究的可重构阵列H树型拓扑结构,设计基于树形拓扑的光电混合H树互连网络结构。根据H树拓扑中通过交叉节点的路径数量,设计相应端口数的光电转向开关,满足信号的转向传输需求。基于OMNeT++对光网络层性能进行测试,结果表明在单播模式下簇间传输节点平均冲突率为15.6%,多播模式下簇间传输节点平均冲突率为23.9%。单播模式下,信息注入率为0.6时,本文所提出的路由结构较Corona与2D Mesh路由网络平均吞吐率分别增加了9.3%与22.6%。多播模式下,信息注入率为0.5时,本文所提出的路由网络较Corona与2D Mesh路由网络平均吞吐率分别增加了31.2%与27.3%。 其次,设计两种路由开关满足光电路交换和光分组交换数据传输需求。路由开关设计主要包括数据传输格式以及路由控制单元、混合交换单元、环回缓存单元和数据划分重组单元等电路结构。并进行光电混合路由开关FPGA测试与性能分析。结果表明,次级路由开关的LUT使用总量为828,中心节点路由开关的LUT使用量为783。在均匀模式下,信息注入率为0.6时,提出的路由结构较CDVC、ViChaR与EDVC F-R/W路由开关吞吐率分别增加了35.7%、22.5%与8.5%。在热点模式下,信息注入率为0.6时,本文所提出的路由结构较CDVC、ViChaR与EDVC F-R/W路由开关吞吐率分别增加了44.2%、15.4%与19.6%。 然后,针对簇间多通道的数据交互网络结构,设计同时使用光电路交换(Optical Circuit Switching, OCS)和光分组交换(Optical Packet Switching, OPS)的簇间传输网络。通过设计路由策略,灵活切换光电路交换和光分组交换传输模式,并设计四端口和五端口光路由器进行光电混合传输。实验结果表明,光H树网络与同规模的片上光互连结构如λ-Router、GWOR、Light相比,各路径平均信噪比分别提高了38.5%、36.0%、17.1%。在均匀模式下,信息注入率为0.2时,本文所提出的路由结构相比动态缓存路由网络与网格型路由网络归一化吞吐率分别增加了11.2%与7.0%。在均匀模式下,信息注入率为0.3时,本文所提出的路由结构相比多通道动态缓存路由网络与网格型路由网络平均信息传输延时分别减少了5.5%与10.5%。 最后,基于PyTorch框架搭建LSTM网络模型,通过对模型进行剪枝和量化得到瓦斯浓度预测模型参数导入到FPGA中。并设计LSTM硬件模块的激活函数计算单元、网络运算单元、门状态更新单元。基于混合交换H树路由结构实现LSTM网络运算单元,并对其进行功能测试和性能分析。实验结果表明,瓦斯浓度等级预测软硬件结果基本一致,在20次软硬件结果对比中,仅一次不同。结合混合交换H树的LSTM网络运算单元瓦斯浓度等级分类预测准确率最多提高1.21%。 |
论文外文摘要: |
The optical switching mechanism is an important component of on-chip interconnection communication, which greatly affects the network's performance such as latency and throughput. However, single optical circuit switching and optical packet switching cannot meet the data transmission requirements of reconfigurable array processors in various application scenarios. Therefore, the paper studies and designs an optoelectronic hybrid switching mechanism for array processors. The paper mainly focuses on four aspects: the research and design of H-tree networks based on optoelectronic hybrid switching mechanism, the design and research of hybrid switching routing switches, the research and design of inter cluster data interaction, and the construction of Long Short Term Memory networks based on hybrid switching mechanism and FPGA testing. Firstly, based on the reconfigurable array H-tree topology studied by the project team, a hybrid optoelectronic H-tree interconnection network architecture based on tree topology is designed. Based on OMNeT++, the performance of the optical network layer was tested, and the results showed that the average conflict rate between cluster transmission nodes in unicast mode was 15.6%, while in multicast mode, the average conflict rate between cluster transmission nodes was 23.9%. In unicast mode, when the information injection rate is 0.6, the proposed routing structure increases the average throughput of Corona and 2D Mesh routing networks by 9.3% and 22.6%, respectively. In multicast mode, when the information injection rate is 0.5, the proposed routing network has an average throughput increase of 31.2% and 27.3% compared to Corona and 2D Mesh routing networks, respectively. Secondly, two types of routing switches are designed to meet the data transmission requirements of optical circuit switching and optical packet switching. The design of routing switches mainly includes data transmission formats and circuit structures such as routing control units, hybrid switching units, loop back cache units, and data partitioning and reassembly units. And conduct FPGA testing and performance analysis for optoelectronic hybrid routing switches. The results show that the total LUT usage of secondary routing switches is 828, and the LUT usage of central node routing switches is 783. In uniform mode, when the information injection rate is 0.6, the proposed routing structure increases the switch throughput by 35.7%, 22.5%, and 8.5% compared to CDVC, ViChaR, and EDVC F-R/W routing, respectively. In hotspot mode, when the information injection rate is 0.6, the routing structure proposed in this paper increases the switch throughput by 44.2%, 15.4%, and 19.6% compared to CDVC, ViChaR, and EDVC F-R/W routing, respectively. Then, for the multi-channel data exchange network structure between clusters, design an inter cluster transmission network that uses both optical circuit switching and optical packet switching. The experimental results show that the optical H-tree network and the same scale on-chip optical interconnection structure are as follows λ-Router, GWOR, and Light, the average signal-to-noise ratio of each path has increased by 38.5%, 36.0%, and 17.1%, respectively. In uniform mode, when the information injection rate is 0.2, the proposed routing structure increases the normalized throughput by 11.2% and 7.0%, respectively, compared to dynamic cache routing networks and grid routing networks. In uniform mode, when the information injection rate is 0.3, the proposed routing structure reduces the average information transmission delay by 5.5% and 10.5%, respectively, compared to multi-channel dynamic cache routing networks and grid routing networks. Finally, an LSTM network model was built based on the PyTorch framework, and the parameters of the gas concentration prediction model were obtained by pruning and quantifying the model and imported into FPGA. The activation function computing unit, network computing unit and gate status updating unit of LSTM hardware module are designed. Implement LSTM network computing unit based on hybrid switching H-tree routing structure, and perform functional testing and performance analysis on it. The experimental results show that the software and hardware results of the gas concentration level prediction system are basically consistent, and only one of the 20 software and hardware results comparisons is different. The accuracy of gas concentration level prediction can be improved by up to 1.21%. |
参考文献: | ﹀ |
中图分类号: | TN492 |
开放日期: | 2024-06-19 |