• Chinese Optics Letters
  • Vol. 22, Issue 7, 070602 (2024)
Tianfeng Zhao1, Feng Wen1、*, Mingming Tan2, Baojian Wu1, Bo Xu1, and Kun Qiu1
Author Affiliations
  • 1Key Laboratory of Optical Fiber Sensing and Communication, Ministry of Education, University of Electronic Science and Technology of China, Chengdu 611731, China
  • 2Aston Institute of Photonics Technologies, Aston University, Birmingham B4 7ET, UK
  • show less
    DOI: 10.3788/COL202422.070602 Cite this Article Set citation alerts
    Tianfeng Zhao, Feng Wen, Mingming Tan, Baojian Wu, Bo Xu, Kun Qiu, "Transfer-learning multi-input multi-output equalizer for mode-division multiplexing systems," Chin. Opt. Lett. 22, 070602 (2024) Copy Citation Text show less

    Abstract

    We propose a transfer-learning multi-input multi-output (TL-MIMO) scheme to significantly reduce the required training complexity for converging the equalizers in mode-division multiplexing (MDM) systems. Based on a built three-mode (LP01, LP11a, and LP11b) multiplexed experimental system, we thoughtfully investigate the TL-MIMO performances on the three-typed data, collecting from different sampling times, launching optical powers, and inputting optical signal-to-noise ratios (OSNRs). A dramatic reduction of approximately 40%–83.33% in the required training complexity is achieved in all three scenarios. Furthermore, the good stability of TL-MIMO in both the launched powers and OSNR test bands has also been proved.

    1. Introduction

    Spatial division multiplexing (SDM) technology has been widely studied to address the capacity bottleneck faced by the fiber-optic communication system, by the feature of considering different guide modes or fiber cores as independent channels[1,2]. In SDM systems, few-mode fiber (FMF) has been intensively investigated due to its high multiplexing density and easy implementation[3]. However, various mode-dependent distortions exist in FMF, such as mode-coupling (MC) induced crosstalk, differential mode group delay (DMGD), mode-dependent loss (MDL), and mode-dependent nonlinearity, which severely degrade the signal quality and limit the system transmission performance[4,5]. In order to improve the transmission performance of the SDM system, the digital signal processing (DSP)-based compensation scheme through multi-input multi-output (MIMO) equalizer is the crucial part at the optical receiver[6,7], which could mitigate the distortions across multi-spatial-channels simultaneously. Nevertheless, because of the sensitivity of the MIMO equalizers on the channel status, the retraining operation for MIMO is required for practical SDM systems when different operational conditions are applied, e.g.,  launched powers, optical signal-to-noise ratios (OSNRs), time-varying channel status, etc., resulting in massive expected computation and thereby hindering the real-time application of SDM technology.

    To reduce the computational complexity, the current work mainly focuses on channel optimization, which involves carefully designing the optical fibers to achieve MIMO complexity reduction. On one hand, in earlier years, people focused on DMGD management to reduce MIMO complexity. Low MIMO complexity mode-division multiplexing (MDM) transmission was demonstrated with the 10.5 km low-DMGD four-mode fiber[8]. The maximum DMGD over the C + L band was controlled below 50 ps/km. In addition, to achieve more flexible DMGD management, nearly zero-, positive-, and negative-DMGD two-mode fibers were designed and fabricated[9]. On the other hand, reducing the MIMO scale is also an effective solution. For example, by ignoring the inter-mode coupling induced crosstalk in the weak-coupling scenario, the simplified MIMO scheme is only responsible for the intra-mode equalization[10,11]. This method indeed reduces the MIMO scale, but in practical implementation, the accumulated fusion will damage the original weak coupling property of the optical fiber.

    Transfer learning (TL) has become a powerful tool for fast modeling of various algorithms with the training process[12]. When new tasks appear, TL exploits the similarity in feature distributions between new and existing tasks, thereby lowering the computational cost required for model reconstruction. In recent published literatures, the TL approach is first implemented in optical networks for optical performance monitoring (OPM), such as the quality of transmission (QoT) prediction[13], OSNR monitoring for 10-span standard single-mode fiber (SSMF)[14], and fast modulation format identification (MFI)[15]. In addition, TL has also been used in nonlinearity mitigation algorithms[16-18] by transferring compensation algorithm parameters between the fiber channels with different nonlinear features caused by the transmission length, the launched power, and the OSNR. Similar to the nonlinear fiber channel, the mode-dependent channel state in more sensitive FMF channels is also affected by the above factors. In addition, the time-varying nature of FMF also constitutes a crucial factor contributing to the dynamic changes in the channel state.

    With the increase of the spatial modes considered in the MDM system (approaching 1000 modes already discussed[19]), the massive computation demand for data recovery would be the major issue to this new multiplexing scheme in real industry implementation. It is necessary to reduce the training complexity required for frequent MIMO remodeling in FMF, the currently most sensitive fiber channel. To the best of our knowledge, there are still no works about using TL in the MIMO equalization process for MDM systems to reduce the computational complexity. Therefore, we propose the TL-aided MIMO (TL-MIMO) equalization scheme to significantly reduce the equalizers’ training complexity for the FMF-based transmission system and evaluate its compensation performance in a 3-mode point-to-point experimental system. According to the experimental test, when transferring behavior happens between the data groups collected at different time moments, corresponding to different channel states, the full TL-MIMO method reduces the training complexity by 88.46% compared to training the equalizers from scratch. Moreover, we have also performed TL-MIMO for different launched optical powers and OSNRs. Consequently, the training complexity is reduced by up to 83.33% with good stability in the measured ranges. The experimental results indicate that the reduced training complexity, in terms of training times and training data length, improves the channel tracking efficiency of the MDM system, which is expected to achieve real-time equalization even with the large MIMO scale.

    2. Transfer Learning for MIMO Equalization

    2.1. Transfer-learning-aided MIMO scheme

    We propose the TL-MIMO scheme to reduce the training complexity in equalizer remodeling, and the schematic design is depicted in Fig. 1. The data-aided algorithm, frequency-domain least mean squares (FD-LMS), is adopted as the MIMO equalization algorithm due to its excellent stability and rapid computation speed. When employing FD-LMS, the received data of the MDM system comprises both training data and payload. The payload is recovered by the equalizer W trained with training data, the optimized step size μ, and taps number K. It is worth noting that the computation cost of MIMO equalization primarily involves training W and optimizing the μ-K combinations.

    Schematic design of the proposed TL-MIMO.

    Figure 1.Schematic design of the proposed TL-MIMO.

    As illustrated in Fig. 1, we incorporate the MIMO training process into the TL framework. The upper part represents the source domain DS and the source task TS, where DS comprises a group of received data and TS is set to recover the payload in DS, requiring iterative training of W with the training data from DS to ensure convergence. Then, the lower part of Fig. 1 contains the target domain DT and the target task TT of TL-MIMO. Various groups of received data with distinct channel states are placed in DT, and TT focuses on recovering the payload in DT. In the context of FD-LMS, μ, K, and W are the key parameters, determining the equalizer convergence speed and the bit error rate (BER) level[20]. In order to reduce the computation cost required to achieve TT, by leveraging the similarity in channel states between the received data in Ds and DT, TL-MIMO transfers these three well-trained parameters from Ds to DT as initial values, accelerating the equalizers’ convergence speed of DT.

    2.2. FD-LMS algorithm in TL-MIMO

    Figure 2 illustrates the structure of 2×2 FD-LMS-based MIMO equalization after transferring the equalization parameters (μ, K, W) from DS. As indicated by the three arrows in Fig. 2, the three parameters, respectively, affect the equalizer initialization, updating, and data-block size in MIMO equalization. The steps affected by each parameter are marked with their corresponding colors. Upon algorithm initiation, the twofold oversampled input sequences x1 and x2 undergo serial-to-parallel (S/P) conversion and are then divided into even and odd branches. The sequence in each branch is split into several data blocks of size K, denoted by x1e(n), x1o(n), x2e(n), and x2o(n), where n represents the block index. These blocks are combined into new blocks with the 50% overlap-save method and are then converted to frequency-domain with fast Fourier transform (FFT) expressed as Xim(n)=FFT[xim(n1),xim(n)], where i and m represent the input port index and even/odd branch, respectively.

    2 × 2 FD-LMS-based MIMO structure combined with TL.

    Figure 2.2 × 2 FD-LMS-based MIMO structure combined with TL.

    Because the number of equalizers corresponding to each input port is twice the number of input ports, the system with P input ports has 2P2 equalizers. In the 2×2 MIMO module, the W matrix consists of 8 FD-equalizers, namely W11e, W11o, W12e, W12o, W21e, W21o, W22e, and W22o. These FD-equalizers, denoted as Wijm, are transformed from their corresponding time-domain (TD)-equalizers wijm, that is, Wijm=FFT[wijm,Z(K)], where Z(K) is a zero matrix with a size of 1×K. Since the sizes of wijm and Wijm are 1×K and 1×2K, respectively, the size of W should be 2P2×2K. In TL-MIMO, the transferred W from DS serves as the initial values of the equalizers in DT for subsequent training. The output frequency response Yi(n) is summed from the products of both branches’ data and the corresponding equalizers, as presented in Eq. (1). The output vector yi(n) consists of the last K data points from the inverse fast Fourier transform (IFFT) of Yi(n), Yi(n)=j=1PXje(n)·Wije(n)+Xjo(n)·Wijo(n).

    The following is the training link. We define di(n) as the nth desired output block of the ith input port. The feedback error vector is then defined as ei(n)=di(n)yi(n). After prefixed K zeros, ei(n) is transformed to frequency-domain error vector Ei(n) by FFT. During the nth training time, the equalizers are updated according to the gradient error estimation method, as shown in Eq. (2): Wijm(n+1)=Wijm(n)+μΓim(n),where μ is the transferred step size factor and Γim(n) is calculated according to Eq. (3). τim(n) in Eq. (3) is defined as the first K data in the IFFT results of the product of Xim(n) and Ei(n), Γim(n)=FFT[τim(n),Z(K)].

    In order to save the proportion of training data in the transmission data, we consider the self-recycling training scheme to extend a string of training data several times to fully extract valuable information[21].

    We quantify the algorithm complexity by the times of complex multiplication. For each training time in the FD-LMS algorithm, each Wim(n) undergoes 2K complex multiplications with the input signal Xim(n), the same as their length, to yield the equalized signal. In our work, we set the number of input ports as equal to the excited modes in FMF. For a system with P input ports, it requires 4PK multiplications per port. Similarly, in gradient error estimation, the generation of τim(n) per port involves additional 4PK complex multiplications. In addition, during the equalization, the data of each port are successively processed 8 times by FFT/IFFT, resulting in 8Klog2(2K) times multiplication. Therefore, for a single training time, the number of complex multiplications Csingle in FD-LMS-based MIMO equalization systems can be expressed as Csingle=P×[4PK+4PK+8Klog2(2K)]=8P2K+8PKlog2(2K).

    Moreover, the total number of training times, Ttotal, can be expressed as Ttotal=Trecycling×Ldata/K=Ctrain/K, where Trecycling and Ldata, respectively, represent the self-recycling training times and the length of training data. Here, Ctrain=Trecycling×Ldata is defined as the training complexity. Therefore, the overall training complexity of the FD-LMS-based MIMO equalization system, denoted as Ctotal, is expressed in Eq. (5). This equation indicates that, while maintaining signal quality requirements, effective reduction of training complexity can be achieved through the optimization of training data length and training times, Ctotal=Ttotal×Csingle=Ctrain/K×[8P2K+8PKlog2(2K)]=Trecycling×Ldata×[8P2+8Plog2(2K)].

    For the payload in DT, let Tretrain and Lretrain denote the training times and training data length needed for training from scratch to achieve a specific BER, respectively. Under the same BER requirement, the necessary training times and training data length for using TL-MIMO are denoted as TTL and LTL, respectively. The saved training complexity Csaved by TL-MIMO is expressed as Eq. (6). In Sec. 4, we investigate the performance of TL-MIMO by evaluating the reduction of training complexity, Csaved=(Tretrain×LretrainTTL×LTL)/(Tretrain×Lretrain).

    3. Experimental Setup

    To verify the performance of the proposed TL-MIMO scheme, we built up an MDM coherent transmission system based on a 12-km-length three-mode fiber, depicted in Fig. 3. First, the two independent electrical non-return-to-zero (NRZ) signals generated by an arbitrary waveform generator (AWG) were used to drive an optical I/Q modulator (IQM). A continuous-wave (CW) light of 1552.52 nm as the optical carrier was also injected into the IQM. The data rate of the optical QPSK was 10 Gbit/s. An erbium-doped fiber amplifier (EDFA1), as well as a variable optical attenuator (VOA1), was responsible for adjusting the power level of the optical signal. In order to achieve the OSNR control in the system, an amplified-spontaneous emission (ASE) source was used to generate the noise, which coupled with the signal through the first optical coupler (OC1) with a coupling ratio of 99:1. We monitored the input OSNR through the optical spectrum analyzer (OSA) received from the 1% port of OC1. Then, the distorted signal was divided into three branches through three 3 dB optical couplers (OCs), i.e., OC2, OC3, and OC4. In each path, the VOAs were used to ensure the same input power for each branch before injecting into the mode-selective photonic lantern (MSPL1). Due to the polarization sensitivity of the few-mode system[22], single-mode polarization controllers (PCs) were employed to maintain the few-mode system in a favorable polarization state and ensure similar output power for signals in each mode after demultiplexing. In order to de-correlate the three channels, the different delay was applied through three pieces of the single-mode fibers with different lengths placed at the input port of MSPL1. Consequently, the data frames transmitted by the other two branches have different symbol offsets compared to that by the first branch. The mode conversion occurred at the MSPL1, which also connected with the three-mode transmission fiber through its few-mode pigtail. The same three-mode graded-refractive-index fiber for the MSPL and the FMF channel was produced by YOFC[23]. The three-mode signals, LP01, LP11a, and LP11b propagated through the 12-km-length fiber and were injected into MSPL2. Then, three single-mode signals were obtained after the MSPL2 and detected by a coherent receiver (Co-Rx) separately. We used an optical switch (OS) to select the data each time for one channel. We calculated the relative symbol offsets among the channels based on the length differences of the delay lines placed before MSPL1. The header of each frame was defined by cropping the respective symbol offset from the originally received frame of each branch. The real-time sampling was performed by an oscilloscope with a sampling rate of 50 GSa/s.

    Coherent three-mode multiplexed experimental setup.

    Figure 3.Coherent three-mode multiplexed experimental setup.

    During the transmission in the FMF, besides the inherent distortion, such as the attenuation and the dispersion that already existed in the SSMF, the major degradation came from the mode-dependent distortions (e.g.,  MC and DMGD). So as to mitigate the influence of the mode-related distortions, the MIMO equalizer is an important module in DSP algorithms for MDM systems. In Fig. 3, the offline DSP processing flow is illustrated, wherein the signal successively undergoes IQ balancing, Gardner-algorithm-based time synchronization[24], Schmidl-algorithm-based frame synchronization[25], and quartic method-based frequency offset estimation[26] before being processed by the MIMO module.

    4. Results and Discussion

    In order to investigate the performance of the proposed TL-MIMO scheme, we have measured the experimental results in three scenarios: different time moments, different launched powers, and different OSNRs, and we investigated the complexity reduction for each case.

    4.1. TL-MIMO between different time moments

    Just like the conventional SSMF-based communication system, the system performance is variable due to the slowly varying factors such as the state-of-polarization (SOP) evolution, which results in the equalizer retraining at the receiver. During parallel transmission in the FMF channel, time-dependent distortions, such as crosstalk induced by MC, can result in fluctuations in transmission performance[27,28]. Therefore, in the MDM system, equalizer remodeling is an important step to adapt to the latest transmission conditions. It is necessary to investigate the features of the TL-MIMO scheme in this most practical scenario.

    In this scenario, we have calculated the BER results of the data collected at different time moments, the results with retraining and two TL-MIMO methods, which are depicted in Fig. 4. For this test, the data length of each frame was 81,920, and the launched power and the input OSNR of the signals in both DS and DT were 4dBm and 25 dB, respectively. First, taking a set of data as an example, Fig. 4(a) depicts the BER results with the number of self-recycling times Trecycling from 0 to 40 and the fixed length of training data Ldata of 16,384. The blue line represents the process of the training from scratch. The yellow line with the legend of TL w/o W (partial TL-MIMO method) means that the transferred parameters are the optimal μ-K combination of DS, but the values of W in DT are initialized as zeros. The red line corresponds to the full TL-MIMO method. In this method, besides the optimal μ-K combination, the values of the well-trained W in DS are also transferred to the target training system. In addition, since the signals of three modes are generated by the same data source, the output three-channel signals are consistent after MIMO equalization. We can find that the BER trained from scratch requires at least 2 self-recycling times to meet the 7% HD-FEC threshold of 3.8×103, but whether the partial or full TL-MIMO method is used, only one self-recycling time is sufficient to meet this requirement. Therefore, according to Eq. (6), TL-MIMO reduced the training times by 50% with both lower initial BER and faster convergence speed compared to training from scratch. According to Fig. 4(a), the red dotted line initially decreased to its lowest point and then started increasing again with the increase of Trecycling, and the final BER was slightly higher than that of the retraining scenario. Two primary reasons account for this phenomenon: (1) During equalizer training, the overfitting of the equalizers led to degraded BER results; (2) The transferred μ-K combination is not optimal for the payload in DT; thus, the BER results with TL-MIMO after sufficient training were slightly higher. As mentioned in Sec. 2, locating the optimal μ-K combination for each group of data requires multi-calls to the equalization algorithm, resulting in significant computational costs. The TL-MIMO scheme can avoid such computational costs with slight and affordable BER degradation. In addition, we have also demonstrated the constellations for the three methods with equalized BERs of around 1.7×104. The similar constellation results indicate that TL-MIMO has an insignificant influence on the signal features.

    BER results of DT versus (a) Trecycling and (b) Ldata with TL-MIMO, while the data of DT and DS are collected at different time moments.

    Figure 4.BER results of DT versus (a) Trecycling and (b) Ldata with TL-MIMO, while the data of DT and DS are collected at different time moments.

    The length of training data is another determining factor of the training complexity Ctrain as discussed in Section 2. The BER results with Ldata are depicted in Fig. 4(b). The measured Ldata length was from 1024 to 16384 with a step of 1024, and each point in the figure was trained with 40 self-recycling times. In comparison to the necessary 4096 training data for the retraining process to meet the 7% FEC threshold, the shortest length of training data in TL-MIMO, which is 2048, results in a 50% reduction in training complexity.

    In FMF, the evolution speed of the channel state is 1 order of magnitude faster than that in SSMF[29]. To verify the time-varying characteristic of the experimental system built in Sec. 3, we demonstrate the BER distributions on the μ-K plane for the payload in DS and DT, both under the same equalization conditions, as shown in Figs. 5(a) and 5(b). The diversity in BER distribution proves the channel state differences between DS and DT.

    BER versus μ-K combination in (a) DS and (b) DT with the data collected at different time moments.

    Figure 5.BER versus μ-K combination in (a) DS and (b) DT with the data collected at different time moments.

    To investigate the relationship between the BER and the parameters Trecycling and Ldata both before and after applying TL-MIMO more accurately, we monitored the BER results with varying Ldata and Trecycling. The results, both without and with TL-MIMO, are depicted in Figs. 6(a) and 6(b), respectively. In these figures, color highlights areas where the BER is less than 3.8×103. As per Eq. (5), training complexity Ctrain is defined as the product of Trecycling and Ldata. Comparing Figs. 6(a) and 6(b), it is evident that the colored area extends toward lower Ctrain values when the TL-MIMO scheme is applied. Specifically, the lowest Ctrain through training from scratch was 26,624 (Trecycling=2 and Ldata=13,312), marked in Fig. 6(a) with a red pentagram. However, the TL-MIMO scheme achieved a minimum Ctrain of only 3072, marked in Fig. 6(b), corresponding to Trecycling=3 and Ldata=1024. This represents an 88.46% reduction in Ctrain achieved through TL-MIMO scheme.

    BER versus Trecycling × Ldata, illustrating (a) without and (b) with the full TL-MIMO scheme.

    Figure 6.BER versus Trecycling × Ldata, illustrating (a) without and (b) with the full TL-MIMO scheme.

    We have also conducted a comparison between the TL-MIMO scheme and the variable step size FD-LMS (VSS-FD-LMS) algorithm[30], which is designed to accelerate equalizer convergence by optimizing the step size during training. The statistical results of 30 sets of independently received data are presented in Fig. 7, with dotted lines representing the average BER results and shadows covering the range of the tested results. In Fig. 7(a), compared to the VSS algorithm, the TL-MIMO scheme exhibits smaller initial BER values and converges to minimum values at a faster pace. Taking the average BER result as an example, the TL-MIMO scheme in Fig. 7(b) achieves a 60% reduction in Ltrain compared to training from scratch and a 50% reduction compared to the VSS algorithm. These findings indicate the effectiveness of TL-MIMO scheme in reducing Ctrain in MIMO equalization.

    Statistical BER results for 30 datasets in DT comparing different MIMO schemes with respect to (a) Trecycling and (b) Ldata.

    Figure 7.Statistical BER results for 30 datasets in DT comparing different MIMO schemes with respect to (a) Trecycling and (b) Ldata.

    4.2. TL-MIMO between different launched powers

    In this subsection, we measured the TL-MIMO results between different launched optical powers. In the experiment, the launched optical power was adjusted from 15dBm to 4dBm with the fixed input OSNR of 25 dB.

    Figure 8 demonstrates the BER versus Trecycling and Ldata in two transferring cases between different launched powers, i.e., transferring the equalization parameters from the lower launched power of 10dBm to the higher launched power of 4dBm (L-H case), and from 10 to 15dBm (H-L case). The results in Fig. 8 are statistically counted from 30 groups of data. For the L-H case, as depicted in Figs. 8(a) and 8(b), Trecycling and Ldata were reduced by 83.33% and 60%, respectively, by the full TL-MIMO method. For the H-L case, as depicted in Figs. 8(c) and 8(d), the Trecycling and the Ldata were also significantly reduced by 80% and 40%, respectively.

    In L-H case of launched power, BER results versus (a) Trecycling and (b) Ldata; in H-L case of launched power, BER results versus (c) Trecycling and (d) Ldata for 30 groups of data in DT.

    Figure 8.In L-H case of launched power, BER results versus (a) Trecycling and (b) Ldata; in H-L case of launched power, BER results versus (c) Trecycling and (d) Ldata for 30 groups of data in DT.

    To investigate the more detailed TL-MIMO performance for the scenario of the transferring parameters from different launched powers, we have transferred the equalization parameters from well-trained equalizers with different launched powers, i.e., with different transfer powers Pt, to other power points of the test band and depicted the results in Fig. 9. The green, red, and yellow markers, respectively, represent the results with Pt of 15, 10, and 4dBm. In the test, the 16,384 training data and 40 self-recycling times were applied to all the datasets. Through using full TL-MIMO equalization, all experimental results in the testing range of 15 to 4dBm were below the FEC threshold. The similar BER curves obtained by the three transferring cases further confirm the stability of the proposed TL-MIMO scheme. A small BER gap between the case of retraining and TL-MIMO came from the inherent optimal-value issue.

    BER versus launched power in DT with different Pt.

    Figure 9.BER versus launched power in DT with different Pt.

    4.3. TL-MIMO between different OSNRs

    In fiber-optic transmission systems, the OSNR is regarded as a significant parameter to directly judge the channel quality in the optical domain. Hence, it is necessary to analyze the compatibility of the TL-MIMO scheme under different input OSNRs. In the experimental setup of Sec. 3, we could adjust the OSNR of input signals through an ASE source and a power controllable device, i.e., VOA1 in the experiment. The measured OSNR was in the range of 15 to 25 dB with the launched power of 4dBm. As mentioned in the previous subsection, measuring the results in two transferring cases, we transferred parameters here from the case of the input OSNR of 20 dB to the case of 25 dB (L-H case), or to the case of 15 dB (H-L case), also representing the two-way transferring behavior.

    The statistical BER results for the two OSNR transferring cases are depicted in Figs. 10(a) to 10(d). The results prove that TL-MIMO has a significant improvement in reducing training complexity. For the case of transferring from the OSNR of 20 to 25 dB, 83.33% of the Trecycling and 60% of the Ldata were saved by the full TL-MIMO scheme. For another case, the Trecycling and the Ldata were, respectively, saved by 83.33% and 50%. Additionally, in the scenarios corresponding to Figs. 8 and 10, TL-MIMO scheme remains more efficient in our measurements compared to the VSS-FD-LMS algorithm.

    In L-H OSNR case, BER results versus (a) Trecycling and (b) Ldata; in H-L OSNR case, BER results versus (c) Trecycling and (d) Ldata for 30 groups of data in DT.

    Figure 10.In L-H OSNR case, BER results versus (a) Trecycling and (b) Ldata; in H-L OSNR case, BER results versus (c) Trecycling and (d) Ldata for 30 groups of data in DT.

    Figure 11 demonstrates the relationship between the BER and OSNR in different cases where the parameters transferred from the OSNRs of 15, 20, and 25 dB to each OSNR-testing point (see the yellow, red, and green markers). The blue circulars are the tested results from the retraining MIMO. For each marker, it represents the average value of 30 BER data. Similar to Fig. 9, the TL-MIMO scheme for the three transferring cases shows similar results, confirming the stability of the proposed method used in the OSNR scenario. As in the other two scenarios, the small BER penalty is also observed.

    BER versus OSNR in DT with different OSNRt.

    Figure 11.BER versus OSNR in DT with different OSNRt.

    5. Conclusion

    We proposed a TL-MIMO scheme to reduce the equalizer training complexity in MDM transmission systems. Based on a 3-mode multiplexed experimental system with 12 km FMF, we carried out the investigation in three scenarios, i.e., transferring parameters between different time moments, launching optical powers, and inputting OSNRs to verify the performance of the TL-MIMO scheme. Experimental results show that under the premise of achieving the 7% HD-FEC threshold, 88.46% training complexity was reduced with the full TL-MIMO scheme in the first scenario. In the latter two scenarios, the training complexity was reduced by up to 83.33% and at least 40% according to the average results of 30 groups of data. Additionally, TL-MIMO scheme exhibited robust stability and compatibility across different launched powers and OSNR testing bands.

    References

    [1] J. Du, W. Shen, J. Liu et al. Mode division multiplexing: from photonic integration to optical fiber transmission. Chin. Opt. Lett., 19, 091301(2021).

    [2] B. J. Puttnam, G. Rademacher, R. S. Luís. Space-division multiplexing for optical fiber communications. Optica, 8, 1186(2021).

    [3] G. Rademacher, R. S. Luís, B. J. Puttnam et al. 93.34 Tbit/s/mode (280 Tbit/s) transmission in a 3-mode graded-index few-mode fiber. Optical Fiber Communication Conference, W4C.3(2018).

    [4] K. P. Ho, J. M. Kahn. Linear propagation effects in mode-division multiplexing systems. J. Lightwave Technol., 32, 614(2014).

    [5] F. M. Ferreira, C. S. Costa, S. Sygletos et al. Semi-analytical modelling of linear mode coupling in few-mode fibers. J. Lightwave Technol., 35, 4011(2017).

    [6] P. J. Winzer, G. J. Foschini. MIMO capacities and outage probabilities in spatially multiplexed optical transport systems. Opt. Express, 19, 16680(2011).

    [7] K. Shibahara, T. Mizuno, D. Lee et al. Advanced MIMO signal processing techniques enabling long-haul dense SDM transmissions. J. Lightwave Technol., 36, 336(2018).

    [8] T. Mori, T. Sakamoto, M. Wada et al. Low DMD four LP mode transmission fiber for wide-band WDM-MIMO system. Optical Fiber Communication Conference/National Fiber Optic Engineers Conference, OTh3K.1(2013).

    [9] R. Maruyama, N. Kuwaki, S. Matsuo et al. Two mode optical fibers with low and flattened differential modal delay suitable for WDM-MIMO combined system. Opt. Express, 22, 14311(2014).

    [10] D. Soma, S. Beppu, Y. Wakayama et al. 257-Tbit/s weakly coupled 10-mode C+L-band WDM transmission. J. Lightwave Technol., 36, 1375(2018).

    [11] S. Beppu, D. Soma, S. Sumita et al. 402.7-Tb/s MDM-WDM transmission over weakly coupled 10-mode fiber using rate-adaptive PS-16QAM signals. J. Lightwave Technol., 38, 2835(2020).

    [12] S. J. Pan, Q. Yang. A survey on transfer learning. IEEE Trans. Knowl. Data Eng., 22, 1345(2010).

    [13] W. Mo, Y. Huang, S. Zhang et al. ANN-based transfer learning for QoT prediction in real-time mixed line-rate systems. Optical Fiber Communication Conference, W4F.3(2018).

    [14] L. Xia, J. Zhang, S. Hu et al. Transfer learning assisted deep neural network for OSNR estimation. Opt. Express, 27, 19398(2019).

    [15] Y. Cheng, W. Zhang, S. Fu et al. Transfer learning simplified multi-task deep neural network for PDM-64QAM optical performance monitoring. Opt. Express, 28, 7607(2020).

    [16] J. Zhang, L. Xia, M. Zhu et al. Fast remodeling for nonlinear distortion mitigation based on transfer learning. Opt. Lett., 44, 4243(2019).

    [17] W. Zhang, T. Jin, T. Xu et al. Nonlinear mitigation with TL-NN-NLC in coherent optical fiber communications. Asia Communications and Photonics Conference (ACP) and International Conference on Information Photonics and Optical Communications (IPOC), M4A.321(2020).

    [18] P. J. Freire, D. Abode, J. E. Prilepsky et al. Transfer learning for neural networks-based equalizers in coherent optical systems. J. Lightwave Technol., 39, 6733(2021).

    [19] F. M. Ferreira, F. A. Barbosa. Towards 1000-mode optical fibres(2022).

    [20] B. F. Boroujeny, K. S. Chan. Analysis of the frequency-domain block LMS algorithm. IEEE Trans. Signal Process., 48, 2332(2000).

    [21] T. Zhao, F. Wen, F. Li et al. Noise compensation for a nonlinear six-mode fiber channel through self-recycling training equalizer. Opto-Electronics and Communications Conference (OECC), M3F.5(2021).

    [22] M. Arikawa, T. Ito. Performance of mode diversity reception of a polarization-division-multiplexed signal for free-space optical communication under atmospheric turbulence. Opt. Express, 26, 28263(2018).

    [23]

    [24] F. Gardner. A BPSK/QPSK timing-error detector for sampled receivers. IEEE Trans. Commun., 34, 423(1986).

    [25] T. M. Schmidl, D. C. Cox. Robust frequency and timing synchronization for OFDM. IEEE Trans. Commun., 45, 1613(1997).

    [26] A. Leven, N. Kaneda, U.-V. Koc et al. Frequency estimation in intradyne reception. IEEE Photon. Technol. Lett., 19, 366(2007).

    [27] M. Nakazawa, M. Yoshida, T. Hirooka. Measurement of mode coupling distribution along a few-mode fiber using a synchronous multi-channel OTDR. Opt. Express, 22, 31299(2014).

    [28] T. Zhao, F. Wen, K. Qiu. Spectral features with the temporal and spatial mode-coupling dynamic in a few-mode system. Photonics & Electromagnetics Research Symposium (PIERS)(2021).

    [29] X. Chen, J. He, A. Li et al. Characterization and analysis of few-mode fiber channel dynamics. IEEE Photon. Technol. Lett., 25, 1819(2013).

    [30] J. Qin, J. Ouyang. A novel variable step size LMS adaptive filtering algorithm. J. Data Acquisition Process., 12, 171(1997).

    Tianfeng Zhao, Feng Wen, Mingming Tan, Baojian Wu, Bo Xu, Kun Qiu, "Transfer-learning multi-input multi-output equalizer for mode-division multiplexing systems," Chin. Opt. Lett. 22, 070602 (2024)
    Download Citation