- Chinese Optics Letters
- Vol. 22, Issue 7, 070602 (2024)
Abstract
1. Introduction
Spatial division multiplexing (SDM) technology has been widely studied to address the capacity bottleneck faced by the fiber-optic communication system, by the feature of considering different guide modes or fiber cores as independent channels[1,2]. In SDM systems, few-mode fiber (FMF) has been intensively investigated due to its high multiplexing density and easy implementation[3]. However, various mode-dependent distortions exist in FMF, such as mode-coupling (MC) induced crosstalk, differential mode group delay (DMGD), mode-dependent loss (MDL), and mode-dependent nonlinearity, which severely degrade the signal quality and limit the system transmission performance[4,5]. In order to improve the transmission performance of the SDM system, the digital signal processing (DSP)-based compensation scheme through multi-input multi-output (MIMO) equalizer is the crucial part at the optical receiver[6,7], which could mitigate the distortions across multi-spatial-channels simultaneously. Nevertheless, because of the sensitivity of the MIMO equalizers on the channel status, the retraining operation for MIMO is required for practical SDM systems when different operational conditions are applied, e.g., launched powers, optical signal-to-noise ratios (OSNRs), time-varying channel status, etc., resulting in massive expected computation and thereby hindering the real-time application of SDM technology.
To reduce the computational complexity, the current work mainly focuses on channel optimization, which involves carefully designing the optical fibers to achieve MIMO complexity reduction. On one hand, in earlier years, people focused on DMGD management to reduce MIMO complexity. Low MIMO complexity mode-division multiplexing (MDM) transmission was demonstrated with the 10.5 km low-DMGD four-mode fiber[8]. The maximum DMGD over the C + L band was controlled below 50 ps/km. In addition, to achieve more flexible DMGD management, nearly zero-, positive-, and negative-DMGD two-mode fibers were designed and fabricated[9]. On the other hand, reducing the MIMO scale is also an effective solution. For example, by ignoring the inter-mode coupling induced crosstalk in the weak-coupling scenario, the simplified MIMO scheme is only responsible for the intra-mode equalization[10,11]. This method indeed reduces the MIMO scale, but in practical implementation, the accumulated fusion will damage the original weak coupling property of the optical fiber.
Transfer learning (TL) has become a powerful tool for fast modeling of various algorithms with the training process[12]. When new tasks appear, TL exploits the similarity in feature distributions between new and existing tasks, thereby lowering the computational cost required for model reconstruction. In recent published literatures, the TL approach is first implemented in optical networks for optical performance monitoring (OPM), such as the quality of transmission (QoT) prediction[13], OSNR monitoring for 10-span standard single-mode fiber (SSMF)[14], and fast modulation format identification (MFI)[15]. In addition, TL has also been used in nonlinearity mitigation algorithms[16-18] by transferring compensation algorithm parameters between the fiber channels with different nonlinear features caused by the transmission length, the launched power, and the OSNR. Similar to the nonlinear fiber channel, the mode-dependent channel state in more sensitive FMF channels is also affected by the above factors. In addition, the time-varying nature of FMF also constitutes a crucial factor contributing to the dynamic changes in the channel state.
Sign up for Chinese Optics Letters TOC. Get the latest issue of Chinese Optics Letters delivered right to you!Sign up now
With the increase of the spatial modes considered in the MDM system (approaching 1000 modes already discussed[19]), the massive computation demand for data recovery would be the major issue to this new multiplexing scheme in real industry implementation. It is necessary to reduce the training complexity required for frequent MIMO remodeling in FMF, the currently most sensitive fiber channel. To the best of our knowledge, there are still no works about using TL in the MIMO equalization process for MDM systems to reduce the computational complexity. Therefore, we propose the TL-aided MIMO (TL-MIMO) equalization scheme to significantly reduce the equalizers’ training complexity for the FMF-based transmission system and evaluate its compensation performance in a 3-mode point-to-point experimental system. According to the experimental test, when transferring behavior happens between the data groups collected at different time moments, corresponding to different channel states, the full TL-MIMO method reduces the training complexity by 88.46% compared to training the equalizers from scratch. Moreover, we have also performed TL-MIMO for different launched optical powers and OSNRs. Consequently, the training complexity is reduced by up to 83.33% with good stability in the measured ranges. The experimental results indicate that the reduced training complexity, in terms of training times and training data length, improves the channel tracking efficiency of the MDM system, which is expected to achieve real-time equalization even with the large MIMO scale.
2. Transfer Learning for MIMO Equalization
2.1. Transfer-learning-aided MIMO scheme
We propose the TL-MIMO scheme to reduce the training complexity in equalizer remodeling, and the schematic design is depicted in Fig. 1. The data-aided algorithm, frequency-domain least mean squares (FD-LMS), is adopted as the MIMO equalization algorithm due to its excellent stability and rapid computation speed. When employing FD-LMS, the received data of the MDM system comprises both training data and payload. The payload is recovered by the equalizer trained with training data, the optimized step size , and taps number . It is worth noting that the computation cost of MIMO equalization primarily involves training and optimizing the combinations.
Figure 1.Schematic design of the proposed TL-MIMO.
As illustrated in Fig. 1, we incorporate the MIMO training process into the TL framework. The upper part represents the source domain and the source task , where comprises a group of received data and is set to recover the payload in , requiring iterative training of with the training data from to ensure convergence. Then, the lower part of Fig. 1 contains the target domain and the target task of TL-MIMO. Various groups of received data with distinct channel states are placed in , and focuses on recovering the payload in . In the context of FD-LMS, , , and are the key parameters, determining the equalizer convergence speed and the bit error rate (BER) level[20]. In order to reduce the computation cost required to achieve , by leveraging the similarity in channel states between the received data in and , TL-MIMO transfers these three well-trained parameters from to as initial values, accelerating the equalizers’ convergence speed of .
2.2. FD-LMS algorithm in TL-MIMO
Figure 2 illustrates the structure of FD-LMS-based MIMO equalization after transferring the equalization parameters (, , ) from . As indicated by the three arrows in Fig. 2, the three parameters, respectively, affect the equalizer initialization, updating, and data-block size in MIMO equalization. The steps affected by each parameter are marked with their corresponding colors. Upon algorithm initiation, the twofold oversampled input sequences and undergo serial-to-parallel (S/P) conversion and are then divided into even and odd branches. The sequence in each branch is split into several data blocks of size , denoted by , , , and , where represents the block index. These blocks are combined into new blocks with the 50% overlap-save method and are then converted to frequency-domain with fast Fourier transform (FFT) expressed as , where i and m represent the input port index and even/odd branch, respectively.
Figure 2.2 × 2 FD-LMS-based MIMO structure combined with TL.
Because the number of equalizers corresponding to each input port is twice the number of input ports, the system with input ports has equalizers. In the MIMO module, the matrix consists of 8 FD-equalizers, namely , , , , , , , and . These FD-equalizers, denoted as , are transformed from their corresponding time-domain (TD)-equalizers , that is, , where is a zero matrix with a size of . Since the sizes of and are and , respectively, the size of should be . In TL-MIMO, the transferred from serves as the initial values of the equalizers in for subsequent training. The output frequency response is summed from the products of both branches’ data and the corresponding equalizers, as presented in Eq. (1). The output vector consists of the last data points from the inverse fast Fourier transform (IFFT) of ,
The following is the training link. We define as the nth desired output block of the ith input port. The feedback error vector is then defined as . After prefixed zeros, is transformed to frequency-domain error vector by FFT. During the nth training time, the equalizers are updated according to the gradient error estimation method, as shown in Eq. (2):
In order to save the proportion of training data in the transmission data, we consider the self-recycling training scheme to extend a string of training data several times to fully extract valuable information[21].
We quantify the algorithm complexity by the times of complex multiplication. For each training time in the FD-LMS algorithm, each undergoes complex multiplications with the input signal , the same as their length, to yield the equalized signal. In our work, we set the number of input ports as equal to the excited modes in FMF. For a system with input ports, it requires multiplications per port. Similarly, in gradient error estimation, the generation of per port involves additional complex multiplications. In addition, during the equalization, the data of each port are successively processed 8 times by FFT/IFFT, resulting in times multiplication. Therefore, for a single training time, the number of complex multiplications in FD-LMS-based MIMO equalization systems can be expressed as
Moreover, the total number of training times, , can be expressed as , where and , respectively, represent the self-recycling training times and the length of training data. Here, is defined as the training complexity. Therefore, the overall training complexity of the FD-LMS-based MIMO equalization system, denoted as , is expressed in Eq. (5). This equation indicates that, while maintaining signal quality requirements, effective reduction of training complexity can be achieved through the optimization of training data length and training times,
For the payload in , let and denote the training times and training data length needed for training from scratch to achieve a specific BER, respectively. Under the same BER requirement, the necessary training times and training data length for using TL-MIMO are denoted as and , respectively. The saved training complexity by TL-MIMO is expressed as Eq. (6). In Sec. 4, we investigate the performance of TL-MIMO by evaluating the reduction of training complexity,
3. Experimental Setup
To verify the performance of the proposed TL-MIMO scheme, we built up an MDM coherent transmission system based on a 12-km-length three-mode fiber, depicted in Fig. 3. First, the two independent electrical non-return-to-zero (NRZ) signals generated by an arbitrary waveform generator (AWG) were used to drive an optical I/Q modulator (IQM). A continuous-wave (CW) light of 1552.52 nm as the optical carrier was also injected into the IQM. The data rate of the optical QPSK was 10 Gbit/s. An erbium-doped fiber amplifier (), as well as a variable optical attenuator (), was responsible for adjusting the power level of the optical signal. In order to achieve the OSNR control in the system, an amplified-spontaneous emission (ASE) source was used to generate the noise, which coupled with the signal through the first optical coupler () with a coupling ratio of 99:1. We monitored the input OSNR through the optical spectrum analyzer (OSA) received from the 1% port of . Then, the distorted signal was divided into three branches through three 3 dB optical couplers (OCs), i.e., , , and . In each path, the VOAs were used to ensure the same input power for each branch before injecting into the mode-selective photonic lantern (). Due to the polarization sensitivity of the few-mode system[22], single-mode polarization controllers (PCs) were employed to maintain the few-mode system in a favorable polarization state and ensure similar output power for signals in each mode after demultiplexing. In order to de-correlate the three channels, the different delay was applied through three pieces of the single-mode fibers with different lengths placed at the input port of . Consequently, the data frames transmitted by the other two branches have different symbol offsets compared to that by the first branch. The mode conversion occurred at the , which also connected with the three-mode transmission fiber through its few-mode pigtail. The same three-mode graded-refractive-index fiber for the MSPL and the FMF channel was produced by YOFC[23]. The three-mode signals, , , and propagated through the 12-km-length fiber and were injected into . Then, three single-mode signals were obtained after the and detected by a coherent receiver (Co-Rx) separately. We used an optical switch (OS) to select the data each time for one channel. We calculated the relative symbol offsets among the channels based on the length differences of the delay lines placed before . The header of each frame was defined by cropping the respective symbol offset from the originally received frame of each branch. The real-time sampling was performed by an oscilloscope with a sampling rate of 50 GSa/s.
Figure 3.Coherent three-mode multiplexed experimental setup.
During the transmission in the FMF, besides the inherent distortion, such as the attenuation and the dispersion that already existed in the SSMF, the major degradation came from the mode-dependent distortions (e.g., MC and DMGD). So as to mitigate the influence of the mode-related distortions, the MIMO equalizer is an important module in DSP algorithms for MDM systems. In Fig. 3, the offline DSP processing flow is illustrated, wherein the signal successively undergoes IQ balancing, Gardner-algorithm-based time synchronization[24], Schmidl-algorithm-based frame synchronization[25], and quartic method-based frequency offset estimation[26] before being processed by the MIMO module.
4. Results and Discussion
In order to investigate the performance of the proposed TL-MIMO scheme, we have measured the experimental results in three scenarios: different time moments, different launched powers, and different OSNRs, and we investigated the complexity reduction for each case.
4.1. TL-MIMO between different time moments
Just like the conventional SSMF-based communication system, the system performance is variable due to the slowly varying factors such as the state-of-polarization (SOP) evolution, which results in the equalizer retraining at the receiver. During parallel transmission in the FMF channel, time-dependent distortions, such as crosstalk induced by MC, can result in fluctuations in transmission performance[27,28]. Therefore, in the MDM system, equalizer remodeling is an important step to adapt to the latest transmission conditions. It is necessary to investigate the features of the TL-MIMO scheme in this most practical scenario.
In this scenario, we have calculated the BER results of the data collected at different time moments, the results with retraining and two TL-MIMO methods, which are depicted in Fig. 4. For this test, the data length of each frame was 81,920, and the launched power and the input OSNR of the signals in both and were and 25 dB, respectively. First, taking a set of data as an example, Fig. 4(a) depicts the BER results with the number of self-recycling times from 0 to 40 and the fixed length of training data of 16,384. The blue line represents the process of the training from scratch. The yellow line with the legend of TL w/o (partial TL-MIMO method) means that the transferred parameters are the optimal combination of , but the values of in are initialized as zeros. The red line corresponds to the full TL-MIMO method. In this method, besides the optimal combination, the values of the well-trained in are also transferred to the target training system. In addition, since the signals of three modes are generated by the same data source, the output three-channel signals are consistent after MIMO equalization. We can find that the BER trained from scratch requires at least 2 self-recycling times to meet the 7% HD-FEC threshold of , but whether the partial or full TL-MIMO method is used, only one self-recycling time is sufficient to meet this requirement. Therefore, according to Eq. (6), TL-MIMO reduced the training times by 50% with both lower initial BER and faster convergence speed compared to training from scratch. According to Fig. 4(a), the red dotted line initially decreased to its lowest point and then started increasing again with the increase of , and the final BER was slightly higher than that of the retraining scenario. Two primary reasons account for this phenomenon: (1) During equalizer training, the overfitting of the equalizers led to degraded BER results; (2) The transferred combination is not optimal for the payload in ; thus, the BER results with TL-MIMO after sufficient training were slightly higher. As mentioned in Sec. 2, locating the optimal combination for each group of data requires multi-calls to the equalization algorithm, resulting in significant computational costs. The TL-MIMO scheme can avoid such computational costs with slight and affordable BER degradation. In addition, we have also demonstrated the constellations for the three methods with equalized BERs of around . The similar constellation results indicate that TL-MIMO has an insignificant influence on the signal features.
Figure 4.BER results of DT versus (a) Trecycling and (b) Ldata with TL-MIMO, while the data of DT and DS are collected at different time moments.
The length of training data is another determining factor of the training complexity as discussed in Section 2. The BER results with are depicted in Fig. 4(b). The measured length was from 1024 to 16384 with a step of 1024, and each point in the figure was trained with 40 self-recycling times. In comparison to the necessary 4096 training data for the retraining process to meet the 7% FEC threshold, the shortest length of training data in TL-MIMO, which is 2048, results in a 50% reduction in training complexity.
In FMF, the evolution speed of the channel state is 1 order of magnitude faster than that in SSMF[29]. To verify the time-varying characteristic of the experimental system built in Sec. 3, we demonstrate the BER distributions on the plane for the payload in and , both under the same equalization conditions, as shown in Figs. 5(a) and 5(b). The diversity in BER distribution proves the channel state differences between and .
Figure 5.BER versus μ-K combination in (a) DS and (b) DT with the data collected at different time moments.
To investigate the relationship between the BER and the parameters and both before and after applying TL-MIMO more accurately, we monitored the BER results with varying and . The results, both without and with TL-MIMO, are depicted in Figs. 6(a) and 6(b), respectively. In these figures, color highlights areas where the BER is less than . As per Eq. (5), training complexity is defined as the product of and . Comparing Figs. 6(a) and 6(b), it is evident that the colored area extends toward lower values when the TL-MIMO scheme is applied. Specifically, the lowest through training from scratch was 26,624 ( and ), marked in Fig. 6(a) with a red pentagram. However, the TL-MIMO scheme achieved a minimum of only 3072, marked in Fig. 6(b), corresponding to and . This represents an 88.46% reduction in achieved through TL-MIMO scheme.
Figure 6.BER versus Trecycling × Ldata, illustrating (a) without and (b) with the full TL-MIMO scheme.
We have also conducted a comparison between the TL-MIMO scheme and the variable step size FD-LMS (VSS-FD-LMS) algorithm[30], which is designed to accelerate equalizer convergence by optimizing the step size during training. The statistical results of 30 sets of independently received data are presented in Fig. 7, with dotted lines representing the average BER results and shadows covering the range of the tested results. In Fig. 7(a), compared to the VSS algorithm, the TL-MIMO scheme exhibits smaller initial BER values and converges to minimum values at a faster pace. Taking the average BER result as an example, the TL-MIMO scheme in Fig. 7(b) achieves a 60% reduction in compared to training from scratch and a 50% reduction compared to the VSS algorithm. These findings indicate the effectiveness of TL-MIMO scheme in reducing in MIMO equalization.
Figure 7.Statistical BER results for 30 datasets in DT comparing different MIMO schemes with respect to (a) Trecycling and (b) Ldata.
4.2. TL-MIMO between different launched powers
In this subsection, we measured the TL-MIMO results between different launched optical powers. In the experiment, the launched optical power was adjusted from to with the fixed input OSNR of 25 dB.
Figure 8 demonstrates the BER versus and in two transferring cases between different launched powers, i.e., transferring the equalization parameters from the lower launched power of to the higher launched power of (L-H case), and from to (H-L case). The results in Fig. 8 are statistically counted from 30 groups of data. For the L-H case, as depicted in Figs. 8(a) and 8(b), and were reduced by 83.33% and 60%, respectively, by the full TL-MIMO method. For the H-L case, as depicted in Figs. 8(c) and 8(d), the and the were also significantly reduced by 80% and 40%, respectively.
Figure 8.In L-H case of launched power, BER results versus (a) Trecycling and (b) Ldata; in H-L case of launched power, BER results versus (c) Trecycling and (d) Ldata for 30 groups of data in DT.
To investigate the more detailed TL-MIMO performance for the scenario of the transferring parameters from different launched powers, we have transferred the equalization parameters from well-trained equalizers with different launched powers, i.e., with different transfer powers , to other power points of the test band and depicted the results in Fig. 9. The green, red, and yellow markers, respectively, represent the results with of , , and . In the test, the 16,384 training data and 40 self-recycling times were applied to all the datasets. Through using full TL-MIMO equalization, all experimental results in the testing range of to were below the FEC threshold. The similar BER curves obtained by the three transferring cases further confirm the stability of the proposed TL-MIMO scheme. A small BER gap between the case of retraining and TL-MIMO came from the inherent optimal-value issue.
Figure 9.BER versus launched power in DT with different Pt.
4.3. TL-MIMO between different OSNRs
In fiber-optic transmission systems, the OSNR is regarded as a significant parameter to directly judge the channel quality in the optical domain. Hence, it is necessary to analyze the compatibility of the TL-MIMO scheme under different input OSNRs. In the experimental setup of Sec. 3, we could adjust the OSNR of input signals through an ASE source and a power controllable device, i.e., in the experiment. The measured OSNR was in the range of 15 to 25 dB with the launched power of . As mentioned in the previous subsection, measuring the results in two transferring cases, we transferred parameters here from the case of the input OSNR of 20 dB to the case of 25 dB (L-H case), or to the case of 15 dB (H-L case), also representing the two-way transferring behavior.
The statistical BER results for the two OSNR transferring cases are depicted in Figs. 10(a) to 10(d). The results prove that TL-MIMO has a significant improvement in reducing training complexity. For the case of transferring from the OSNR of 20 to 25 dB, 83.33% of the and 60% of the were saved by the full TL-MIMO scheme. For another case, the and the were, respectively, saved by 83.33% and 50%. Additionally, in the scenarios corresponding to Figs. 8 and 10, TL-MIMO scheme remains more efficient in our measurements compared to the VSS-FD-LMS algorithm.
Figure 10.In L-H OSNR case, BER results versus (a) Trecycling and (b) Ldata; in H-L OSNR case, BER results versus (c) Trecycling and (d) Ldata for 30 groups of data in DT.
Figure 11 demonstrates the relationship between the BER and OSNR in different cases where the parameters transferred from the OSNRs of 15, 20, and 25 dB to each OSNR-testing point (see the yellow, red, and green markers). The blue circulars are the tested results from the retraining MIMO. For each marker, it represents the average value of 30 BER data. Similar to Fig. 9, the TL-MIMO scheme for the three transferring cases shows similar results, confirming the stability of the proposed method used in the OSNR scenario. As in the other two scenarios, the small BER penalty is also observed.
Figure 11.BER versus OSNR in DT with different OSNRt.
5. Conclusion
We proposed a TL-MIMO scheme to reduce the equalizer training complexity in MDM transmission systems. Based on a 3-mode multiplexed experimental system with 12 km FMF, we carried out the investigation in three scenarios, i.e., transferring parameters between different time moments, launching optical powers, and inputting OSNRs to verify the performance of the TL-MIMO scheme. Experimental results show that under the premise of achieving the 7% HD-FEC threshold, 88.46% training complexity was reduced with the full TL-MIMO scheme in the first scenario. In the latter two scenarios, the training complexity was reduced by up to 83.33% and at least 40% according to the average results of 30 groups of data. Additionally, TL-MIMO scheme exhibited robust stability and compatibility across different launched powers and OSNR testing bands.
References
[3] G. Rademacher, R. S. Luís, B. J. Puttnam et al. 93.34 Tbit/s/mode (280 Tbit/s) transmission in a 3-mode graded-index few-mode fiber. Optical Fiber Communication Conference, W4C.3(2018).
[8] T. Mori, T. Sakamoto, M. Wada et al. Low DMD four LP mode transmission fiber for wide-band WDM-MIMO system. Optical Fiber Communication Conference/National Fiber Optic Engineers Conference, OTh3K.1(2013).
[13] W. Mo, Y. Huang, S. Zhang et al. ANN-based transfer learning for QoT prediction in real-time mixed line-rate systems. Optical Fiber Communication Conference, W4F.3(2018).
[17] W. Zhang, T. Jin, T. Xu et al. Nonlinear mitigation with TL-NN-NLC in coherent optical fiber communications. Asia Communications and Photonics Conference (ACP) and International Conference on Information Photonics and Optical Communications (IPOC), M4A.321(2020).
[19] F. M. Ferreira, F. A. Barbosa. Towards 1000-mode optical fibres(2022).
[21] T. Zhao, F. Wen, F. Li et al. Noise compensation for a nonlinear six-mode fiber channel through self-recycling training equalizer. Opto-Electronics and Communications Conference (OECC), M3F.5(2021).
[28] T. Zhao, F. Wen, K. Qiu. Spectral features with the temporal and spatial mode-coupling dynamic in a few-mode system. Photonics & Electromagnetics Research Symposium (PIERS)(2021).
[30] J. Qin, J. Ouyang. A novel variable step size LMS adaptive filtering algorithm. J. Data Acquisition Process., 12, 171(1997).
Set citation alerts for the article
Please enter your email address