
- Chinese Optics Letters
- Vol. 22, Issue 11, 110602 (2024)
Abstract
1. Introduction
Due to the capability of carrying orbital angular momentum (OAM) and possessing distinctive polarization characteristics, vector vortex beams (VVBs) are widely applied in the fields of super-resolution microscopy[1,2], molecule orientation imaging[3], and mode-division multiplexing (MDM)[4–6]. In particular, focusing radially polarized light can create a strong longitudinally electrical component at the focus, while tightly focused azimuthally polarization leads to a strong axial magnetic field and a vanishing electric field[2,3]. Among them, cylindrical vector beams, as orthogonal eigenmodes, offer more available high-order modes, which can further expand the communication capacity in multi-mode fibers. Through vector-eigenmodes (VMs) modulation, linearly polarized (LP) or circularly polarized (CP) OAM can be exploited as information channels[7]. However, due to mode coupling and interference in fibers, hybrid VMs are inevitably involved in the transmission. Therefore, there is a challenge in identifying modal coefficients in the fields of optical communication and fiber MDM.
In recent years, modal decomposition (MD) techniques based on numerical methods, such as inverse matrix decomposition[8,9] and the stochastic parallel gradient descent (SPGD) algorithm[10,11], have rapidly become essential tools for revealing the characteristics of optical field modes. However, they may suffer from the drawback of falling into local optimal solutions. Meanwhile, deep learning (DL) algorithms, especially convolutional neural networks (CNN), have shown promise in the field of optics due to their powerful global search capabilities[12–14], but most of them are retrieved based on only-intensity beam distributions. More polarization-projected intensity distributions azimuthally across the transversal beam are necessary to differentiate high-order degenerate VMs, which will lead to training computational costs for the DL process increasing greatly. The common concern is that DL-based networks are typically trained on simulated data and then applied to experimental data. Due to the lack of an equivalent relationship between simulated and experimental data, the reliability of experimentally reconstructed modal results cannot be guaranteed.
Here, to address these issues, we first attempt to validate the modal coefficients of the measured hybrid VVBs using the DL-based SPGD algorithm by retrieving multiplex polarized intensities, instead of only considering the beam intensity. Leveraging the correspondence between the waveplate angles and the modal coefficients, as well as polarization characteristics of the hybrid VMs, Xception models with different channel numbers are trained based on the simulation data and selected to decompose the intensity distribution of four polarization directions. In combination with the SPDG algorithm, Xception-based predictions of modal coefficients and modal intensity reconstruction in VM and OAM bases are obtained. This method successfully makes an accurate prediction of modal coefficients, which is validated through theoretical calculations based on the waveplate angles, demonstrating a high correlation of up to 99.65%. Second, modal coefficients of typically mixed VVBs are qualitatively analyzed in the form of VM and OAM mode bases. Finally, modal coefficients of first-order and second-order degenerate eigenmodes in few-mode fibers (FMF) by all-fiber mode-selective couplers (MSCs) are reconstructed. The universality of the DL-SPGD algorithm is demonstrated through the precise reconstruction of high-order VMs, which greatly promotes the accurate characteristics of VVBs in laser beam characterization, MDM, and other optical communication fields.
2. Verification Experiment and Results
The first-order VM division multiplexing setup based on a vortex waveplate (VWP) is constructed based on the configuration of a Mach–Zehnder interferometer, as shown in Fig. 1(a). The fundamental mode Gaussian beam (
Figure 1.(a) Diagram of the experimental device. (b) The relationship between the VMs and the angle of Pol1. (θ1) and the angle of HWP (θ2). TEM00, fundamental mode; Pol, polarizer; VWP, vortex waveplate; BS, non-polarizing beam splitter; HWP, half-wave plate; CCD, charge-coupled device.
The transformation relationship between the VMs by changing the waveplate angles is shown in Fig. 1(b). Relative to the fixed VWP,
In this Letter, Xception is chosen as the CNN model because of its unique network structure. It has depthwise separable convolutions, which can significantly reduce the number of parameters and improve the training efficiency[17]. Because the VMs have non-uniform spatial polarization characteristics, the training images with the resolution of
Figure 2.Overall flowchart of the DL-SPGD algorithm. A: 1-channel multi-view input; B: N-channel input. Conv.: Convolution layer, Pool.: Pooling layer, Separable Conv.: depthwise separable convolution layer.
The entire flow of the DL-SPGD algorithm is shown in Fig. 2. The dataset of 40,000 random patterns is first trained in the Xception model for testing prediction. Then, the rough modal coefficients
First, the performance comparison of different channel numbers is presented in Table 1. Validation loss is a crucial metric for evaluating the performance of network models, which are calculated here using the mean squared error between the output and label vectors.
Validation Loss | Mean Modal Error | Correlation (%) | |
---|---|---|---|
4-channel | 6.8 × 10−5 | 1.2 × 10−2, 1.1 × 10−2 | 98.96 |
1-channel | 3.5 × 10−5 | 7.7 × 10−3, 8.3 × 10−3 | 99.28 |
Table 1. Performance Comparison of Channel Number
Based on the performance evaluation and polarization characteristics of the VMs, the validation loss curves are seen in Fig. 3(a), which show a fast convergence after about 90 epochs and a similar loss trend for the first- and second-order VMs. The results indicate that the feature of the matrix pattern can be learned well with a low loss. Another dataset with 1000 polarized intensity pattern is tested, and the average modal error of weights and phases is lower than 0.89% and 0.76%, as shown in Fig. 3(b). It is worth noting that the modal coefficient errors of low-order VMs are smaller than those of high-order ones. This might be understood because high-order VMs possess complicated intensity and polarization distribution.
Figure 3.(a) Validation loss for first- and second-order VMs. (b) Their average errors of modal weights and relative phases. (c) The averaged correlations of the SPGD and the DL- SPGD algorithms for first-order VMs over 200 testing samples. (d) Given and predicted weights and relative phases for first-order VMs.
Second, compared to that from the SPGD algorithm with inputting random coefficients, the predicted modal ones from the Xception-based DL process can not only efficiently acquire convergence with fewer iterations but also improve the accuracy of correction, as shown in Fig. 3(c). The comparisons between given and predicted modal coefficients using DL-SPGD algorithms for first-order VMs are shown in Fig. 3(d). It can be seen that the modal coefficients reconstructed by the DL-SPGD algorithm are closer to the given ones, and the correlation reaches 99.997%. This is due to the fact that the DL-SPGD algorithm perfectly combines the global optimization of DL and the accurate research capability of the SPGD.
Third, we attempt to experimentally validate the modal coefficients of measured VMs through DL-based SPGD algorithm. The modal weights can be calculated based on the Jones Matrix transformation of waveplates at different angles (
Figure 4.(a) Measured (Mea.) and reconstructed (Rec.) intensity patterns of a VM by the DL-SPGD algorithm. The measured and predicted modal coefficients in (b) the VM, (c) the CP-OAM, and (d) the LP-OAM mode bases.
Fourth, the predicted and measured modal coefficients are consistent at any admixture of VMs, which further validates the efficient decomposition using the DL-SPGD algorithm. As shown in Fig. 5, the mixed
Figure 5.Predicted (red dots and blue triangles) and measured (blue and red lines) modal weights. (a) Mixed TM01 and TE01 modes, (b) mixed HE21even and HE21odd modes. Hybrid four VMs when (c) θ1 = 0 and (d) θ1 = π/4. (e1), (f1) The output and polarized patterns of the VMs. Interference patterns when the polarized angles are set at (e2), (f2) 0° and (e3), (f3) 90°. (Mes., measured modal coefficients; Pre., predicted modal coefficients.
Two typical mixed VMs by mode superposition are marked as points A and B in Figs. 5(c) and 5(d), which are considered as degenerate odd and even modes of the same order VMs with a phase difference of
3. Experiment of Recognition in All-Fiber MSCs
Finally, through the precise reconstruction of high-order VM generation using all-fiber MSCs, we demonstrate high-order VM decomposition based on the DL-SPGD algorithm. The measured and predicted patterns of the first- and second-order hybrid VMs are shown in Figs. 6(a1) and 6(b1), respectively. It is found that their measured beam profiles have great consistency with the reconstructed ones. Among them, their intensity distributions with spatially LP-mode separations are non-uniform and polarization-dependent, which can be reconstructed more accurately using multi-view images. The predicted modal coefficients of the first-order VMs are shown in Fig. 6(a2), in which
Figure 6.Experimental (Exp.) and reconstructed (Rec.) intensity patterns of (a1) first-order and (b1) second-order VMs converted from fiber MSCs. Their predicted modal weights and phases of (a2), (b2) VM modes, (a3), (b3) CP-OAM base, and (a4), (b4) LP-OAM base.
4. Conclusion
In summary, the applicability of the DL-SPGD algorithm to high-order vector-eigenmode decomposition is verified based on multiple polarized intensity distributions. Due to the polarization-dependent VMs, the Xception model using multi-view data can provide faster training efficiency and excellent prediction accuracy. The correlation based on the DL-SPGD algorithm can be significantly increased to 99.997%, and the average correlation between the predicted and the measured VMs is 99.65%. The modal coefficients of the typical hybrid VVBs are quantitatively analyzed in the form of VM and OAM modal bases. Finally, the modal coefficients of high-order VMs using fiber MSCs demonstrate the experimental universality of the proposed MD method. This advancement not only bridges the gap between DL techniques and fiber optics but also promotes the precise characterization of VVBs in laser characterization, fiber MDM, and optical communications.
References
[17] F. Chollet. Xception: deep learning with depthwise separable convolutions. Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, 195(2017).

Set citation alerts for the article
Please enter your email address