Asymmetrical estimator for training encapsulated deep photonic neural networks

Wang, Yizhi; Chen, Minjia; Yao, Chunhui; Ma, Jie; Yan, Ting; Penty, Richard; Cheng, Qixiang

Computer Science > Machine Learning

arXiv:2405.18458 (cs)

[Submitted on 28 May 2024 (v1), last revised 15 Aug 2024 (this version, v2)]

Title:Asymmetrical estimator for training encapsulated deep photonic neural networks

Authors:Yizhi Wang, Minjia Chen, Chunhui Yao, Jie Ma, Ting Yan, Richard Penty, Qixiang Cheng

View PDF

Abstract:Scalable isomorphic physical neural networks (PNNs) are emerging NN acceleration paradigms for their high-bandwidth, in-propagation computation. Despite backpropagation (BP)-based training is often the industry standard for its robustness and fast gradient convergences, existing BP-PNN training methods need to truncate the propagation of analogue signal at each layer and acquire accurate hidden neuron readouts for deep networks. This compromises the incentive of PNN for fast in-propagation processing. In addition, the required readouts introduce massive bottlenecks due to the conversions between the analogue-digital interfaces to shuttle information across. These factors limit both the time and energy efficiency during training. Here we introduce the asymmetrical training (AT) method, a BP-based method that can perform training on an encapsulated deep network, where the information propagation is maintained within the analogue domain until the output layer. AT's minimum information access bypass analogue-digital interface bottleneck wherever possible. For any deep network structure, AT offers significantly improved time and energy efficiency compared to existing BP-PNN methods, and scales well for large network sizes. We demonstrated AT's error-tolerant and calibration-free training for encapsulated integrated photonic deep networks to achieve near ideal BP performances. AT's well-behaved training is demonstrated repeatably across different datasets and network structures

Comments:	21 pages, 6 figures
Subjects:	Machine Learning (cs.LG); Optics (physics.optics)
MSC classes:	78-05
Cite as:	arXiv:2405.18458 [cs.LG]
	(or arXiv:2405.18458v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.18458

Submission history

From: Yizhi Wang [view email]
[v1] Tue, 28 May 2024 17:27:20 UTC (1,386 KB)
[v2] Thu, 15 Aug 2024 10:58:17 UTC (1,788 KB)

Computer Science > Machine Learning

Title:Asymmetrical estimator for training encapsulated deep photonic neural networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Asymmetrical estimator for training encapsulated deep photonic neural networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators