• Photonics Research
  • Vol. 13, Issue 4, 915 (2025)
Zheng Huang1,2, Conghe Wang1,2, Caihua Zhang1,2, Wanxin Shi3..., Shukai Wu1,2, Sigang Yang1,2 and Hongwei Chen1,2,*|Show fewer author(s)
Author Affiliations
  • 1Department of Electronic Engineering, Tsinghua University, Beijing 100084, China
  • 2Beijing National Research Center for Information Science and Technology, Beijing 100084, China
  • 3China Mobile Research Institute, Beijing 100053, China
  • show less
    DOI: 10.1364/PRJ.539630 Cite this Article Set citation alerts
    Zheng Huang, Conghe Wang, Caihua Zhang, Wanxin Shi, Shukai Wu, Sigang Yang, Hongwei Chen, "Brain-like training of a pre-sensor optical neural network with a backpropagation-free algorithm," Photonics Res. 13, 915 (2025) Copy Citation Text show less
    Paradigms for training ONNs: in-silico training, in-situ training, combined training.
    Fig. 1. Paradigms for training ONNs: in-silico training, in-situ training, combined training.
    Optimization of the ONNs. (a) The schematic of the training process with FFA. (b) The method for constructing positive and negative samples, using handwritten numbers as an example.
    Fig. 2. Optimization of the ONNs. (a) The schematic of the training process with FFA. (b) The method for constructing positive and negative samples, using handwritten numbers as an example.
    (a) Prototype of the system. (b) Corresponding network structure diagram of the system. (c) Confusion matrix in the classification of MNIST dataset, for the condition that only black box exists in the pre-sensor processing. (d) Confusion matrix in the classification of MNIST dataset, for the condition that black box and SLM coexist in the pre-sensor processing. (e) The optimized pattern of SLM. (f) The visualization of the output after black box. The samples represent the result of the convolutional layer formed by the mask and the nonlinear layer provided by the image intensifier. (g) Accuracy and parameters comparison across different models. The output neurons of the digital linear layer vary from 16 to 24. (h) The performances for the classification of CIFAR-4 dataset. The processing approaches contain only digital computing and the combination of optical and digital computing. (i) The optical nonlinear response of the image intensifier used in the experiments.
    Fig. 3. (a) Prototype of the system. (b) Corresponding network structure diagram of the system. (c) Confusion matrix in the classification of MNIST dataset, for the condition that only black box exists in the pre-sensor processing. (d) Confusion matrix in the classification of MNIST dataset, for the condition that black box and SLM coexist in the pre-sensor processing. (e) The optimized pattern of SLM. (f) The visualization of the output after black box. The samples represent the result of the convolutional layer formed by the mask and the nonlinear layer provided by the image intensifier. (g) Accuracy and parameters comparison across different models. The output neurons of the digital linear layer vary from 16 to 24. (h) The performances for the classification of CIFAR-4 dataset. The processing approaches contain only digital computing and the combination of optical and digital computing. (i) The optical nonlinear response of the image intensifier used in the experiments.
    Experiments for Fashion MNIST dataset. (a) Schematic diagram of the system. The input includes positive and negative samples. (b) Corresponding network structure diagram of the system. “LM” means “linear modulation.” (c) The optimized pattern of SLM in the first optical linear layer of the Hadamard product. (d) The optimized pattern of SLM in the second optical linear layer of Hadamard product. (e) Confusion matrix in the classification of Fashion MNIST dataset, for the condition that the input images are processed sequentially through the black box and SLM in two cycles.
    Fig. 4. Experiments for Fashion MNIST dataset. (a) Schematic diagram of the system. The input includes positive and negative samples. (b) Corresponding network structure diagram of the system. “LM” means “linear modulation.” (c) The optimized pattern of SLM in the first optical linear layer of the Hadamard product. (d) The optimized pattern of SLM in the second optical linear layer of Hadamard product. (e) Confusion matrix in the classification of Fashion MNIST dataset, for the condition that the input images are processed sequentially through the black box and SLM in two cycles.
    ProcessingTest AccuracyNetwork (FC)FLOPs
    FC-BP89.0%(784,22,10)3.5 × 104
    FC-FFA88.5%(1024,20)4.1 × 104
    Black Box (optics) + FC-FFA92.5%(900,20)3.6 × 104
    Black Box (optics) + Linear (optics) + FC-FFA-sim94.0%(900,20)3.6 × 104
    Black Box (optics) + Linear (optics) + FC-FFA-exp93.0%(900,20)3.6 × 104
    Table 1. Comparison of Accuracies between Different Processing Approaches in the Classification of the MNIST Dataset
    ProcessingTest AccuracyNetwork (FC)FLOPs
    FC-BP81.0%(784,22,10)3.5 × 104
    FC-FFA78.5%(1024,20)4.1 × 104
    Black Box (optics) + FC-FFA80.0%(900,20)3.6 × 104
    Black Box (optics) + Linear (optics) + FC-FFA86.0%(900,20)3.6 × 104
    Black Box (optics) + Linear (optics) + Black Box (optics) + FC-FFA87.0%(900,20)3.6 × 104
    Black Box (optics) + Linear (optics) + Black Box (optics) + Linear (optics) + FC-FFA87.5%(900,20)3.6 × 104
    Table 2. Comparison of Accuracies between Different Processing Approaches in the Classification of Fashion MNIST Dataset
    Zheng Huang, Conghe Wang, Caihua Zhang, Wanxin Shi, Shukai Wu, Sigang Yang, Hongwei Chen, "Brain-like training of a pre-sensor optical neural network with a backpropagation-free algorithm," Photonics Res. 13, 915 (2025)
    Download Citation