
- SJ_Zhang
- Jun. 15, 2025
Abstract
It is highly desirable that artificial circularly polarized phosphorescent materials with high luminescence asymmetry factor (glum), narrowband emission and tunable chiral phosphorescent performance can be constructed. Especially, precise control and simultaneous independent switching of circularly polarized fluorescent and phosphorescent performance for the same molecules remain a formidable challenge. Herein, we propose a strategy to customized design of circularly polarized phosphorescent materials based on large language models and transfer learning methods, which not only enables efficient identification of suitable synthesis precursors, but also provides valuable guidance for experimental procedures. We demonstrate the significant advantages of transfer learning with limited chemical data, and precisely fabricate films with high glum (1.86), narrow full-width at half-maximum (49 nm) and customized circularly polarized phosphorescent performance with targeted spectral position. The inverse customization of materials with user-specified circularly polarized fluorescent/phosphorescent performance can be achieved, favoring their application in multicolor display and multidimensional information encryption.
Introduction
In recent years, circularly polarized phosphorescence (CPP) materials have garnered significant interest due to their unique properties including intrinsic chiroptical performance1, long lifetime, large Stokes shift, and various potential applications in the field of 3D displays, anticounterfeiting and so on2,3,4,5,6. To date, a variety of CPP materials such as chiral inorganic materials7, organic small molecules8,9,10,11, self-assembling and polymer systems12,13,14 have been developed and successfully employed in various chiroptical applications including sensing, displays, imaging and information encryption15,16,17. However, in most cases, the luminescence asymmetry factor (glum) of the reported CPP materials fall within the range of 10−3 to 10−2, limiting their practical applications. To further amplify glum value, Prof. Deng et al. successfully doped chiral fluorescent helical polymer with chiral nematic liquid crystals to realize high glum of −1.8718. Similarly, Zhao and his colleagues developed a long-lived room-temperature phosphorescent system with a high glum of 1.49 based on polymer-stabilized cholesteric liquid crystals19. Extensive studies in this field are mainly focused on combining dye components with synthesis of chiral crystals or liquid crystal systems20,21, but few reports have been described to achieve outstanding CPP performance in metal-free amorphous films22. Compared with the well-established technique using chiral molecules or assemblies23,24, twisted stacking of multiple achiral anisotropic functional layers has become a popular method for fabricating chiral nanostructures with high glum values25,26,27,28,which offers more freedom to regulate the chiroptical activity. In the previous studies, our group constructed a series of circularly polarized luminescence materials, in which glum can reach up to 1.929,30. Different from fluorescent materials, phosphorescent molecules possess excellent photophysical behavior stemming from the transition from the triplet excitons to the ground state31,32,33. Therefore, combining circularly polarized fluorescent (CPL) and CPP properties into the same material and realize simultaneous modulation of CPP and CPL (with same or opposite handedness at targeted spectral position) is meaningful, as it can not only expand the freedom of chiral regulation but also integrate new functionalities for applications. However, most reported phosphorescent materials exhibit broad full-width at half-maximum (FWHM) and poor color purity. Additionally, due to the richness in design space for such twist-stacking structures, traditional experimental approaches based on trial-and-error are time-consuming and inefficient. Even in the simplest case, all design parameters must be simultaneously optimized, and the search-and-optimization process has to be repeated from beginning to end for every target chiroptical property (such as a certain glum value at a specific wavelength with a suitable FWHM). Meanwhile, the discovery process of synthesis precursors relies on researchers gathering information from the literature, which significantly slows down the development efficiency of new materials. Therefore, on-demand designing CPP materials with high glum value (>1), narrowband emission and customized CPP performance, especially from achiral amorphous films, remains a significant challenge.
Nowadays, data-driven machine learning (ML) techniques are rapidly gaining significant attention as a powerful and flexible approach to addressing the aforementioned issues. In previous studies, Professor Feng34 and our group29 demonstrated ML-based techniques to guide the synthesis of CPL materials with high glum values and multiple chiral regulation strategies. Typically, an ML model requires training on a substantial amount of data to effectively predict complex structure-property relationships; however, in many instances, the scarcity of high-quality data limits its application35,36,37. To tackle this challenge, transfer learning receives growing emphasis38,39. By pre-training models on similar tasks with larger datasets and subsequently fine-tuning them for target tasks, transfer learning effectively decreases the need for extensive data40,41. Yamada et al. developed a pre-trained model library that enables high-performance predictions of chemical properties under conditions of limited data42. In addition, the powerful text comprehension capabilities of generative language models position them as promising tools for chemical text mining43. With the availability of large-scale materials synthesis literature, large language models (LLMs) can now be used to recommend synthesis precursors for new target materials44,45. It is worth noting that combining LLMs with transfer learning methods can provide valuable guidance for the selection of suitable phosphorescent molecules and the construction of the chiral layer. The on-demand inverse design of CPP films with high glum values, narrowband emission and customized CPP performance guided by this integrated approach with limited chemical data, has not yet been described to date.
In this work, a strategy is utilized to guide the on-demand customization of CPP films based on LLMs and transfer learning (as shown in Fig. 1). Firstly, an LLM is used to search for phosphorescent molecules for the preparation of CPP materials (Fig. 1a). More than 500 articles on phosphorescent materials are processed by an embedding model to build a vector database of literature texts, which is later referenced by an LLM via the Retrieval Augmented Generation (RAG) method to generate a summary of molecules and properties of each article using prompt words, the LLM is used to summarize and screen the text to generate a list of molecules that met the requirements. Here, three phosphorescent molecules are selected manually from the list. Secondly, to build the relationship between structure and function of the CPP materials, machine learning has been introduced (Fig. 1b). Given the complexity of the twisted-stacking structure and the richness in design space, constructing a relationship between structure/process parameters and target CPP performance poses a challenge. Therefore, transfer learning is utilized to construct the structure-function relationship, which facilitates accurate forward prediction of CPP performance from experimental phosphorescent spectra and structure/process parameters. Finally, inverse design and customized manufacturing of CPP films with target functionality can be achieved (Fig. 1c). We demonstrate the significant advantages of transfer learning with limited chemical data, which can guide the precise fabrication of chiral phosphorescent films with high glum up to 1.86, narrow FWHM of 49 nm and customized CPP performance with the targeted spectral position across the full visible spectrum. Moreover, the proposed inverse design enables flexible switching of phosphorescent color with high purity of 95% for multicolor display and customization of both CPL and CPP performance (with the same or opposite handedness at targeted spectral position) from the same phosphorescent molecules, which provides a dimension for information encryption. It is anticipated that these findings not only provide the customization design of CPP materials but also are of great fundamental value for underscoring the promise of ML-driven design approaches in the development of functional materials.
Fig. 1: The on-demand customization of CPP films with target functionality guided by LLMs and transfer learning.
a Recommendation of target materials with LLMs. b Construction of the structure-spectrum-function relationship based on transfer learning. c Inverse design and customized manufacturing of CPP films with target functionality.
Results
LLMs for precursor selection and CPP performance regulation
In the preparation of CPP materials with high glum values, high quantum efficiency and long lifetime, phosphorescent molecule selection plays a crucial role in governing the preparation pathway. To extract the complex and diverse information embedded in chemical literature, LLM was used to choose suitable phosphorescent molecules (Fig. 2a). We collected over 500 documents on the design and discovery of phosphorescent materials, downloaded them as PDF files and processed them into vector databases using LangChain. Utilizing RAG technology, we summarized the textual information and compiled these summaries into a database. This approach processes scientific articles in their entirety. It creates vector embeddings of textual content (paragraphs), then retrieves relevant information based on the semantic similarity of the text, and augments the LLM’s generation with this contextually relevant information. Furthermore, we enabled Long Context mode in LLMs to maintain a detailed record of extended textual content, thus facilitating the processing and generation of complex data sets. Our system processes chemical information as it naturally appears within scientific literature. The LLM processes this information contextually, including any SMILES strings, chemical formulas, or descriptive text about molecular properties that might be present in the retrieved passages. This approach allows for a more natural and comprehensive handling of chemical information within the framework of scientific knowledge (Supplementary Fig. 1). To focus our study, we applied constraints such as room-temperature, long lifetime, and high quantum efficiency, leading to a list of phosphorescent molecules generated by the LLM (Supplementary Table 1). From this list, we manually selected three molecules: 1-pyrenylboronic acid (PA), 7H-dibenzo[c,g]-carbazole (DBCz), and 11,12-dihydroindolo[2,3-a]carbazole (ICz). These molecules all exhibited similar blue fluorescence emissions, but different phosphorescence emissions, including blue for ICz, green for DBCz and red for PA, when they were compounded with polyvinyl alcohol (PVA), respectively (Fig. 2a and Supplementary Figs. 2–4). All chiroptical films were prepared by overlaying one transparent oriented PVA layer (as a phase retarder) onto another highly oriented dyed PVA layer (as a polarizer with selective absorption) in a twisted fashion to fabricate a twisted stacking structure30. Large optical activities can be observed when the twist angle is ±45° (Supplementary Fig. 5), in accordance with the prediction based on Jones Matrix mechanism46. There are several variable parameters, including the thickness and stretching degree of the transparent oriented PVA layer, greyscale and dye molecules selection of highly oriented dyed films as well as the twist angle, which greatly affect their optical activities (Supplementary Figs. 6–9). Then, CPP materials can be obtained by integrating isotropic phosphorescent molecules/PVA hybrid films with above twisted stacking structure films (Fig. 2b). Their CPL and CPP performance could be defined by asymmetry factor glum (glum = 2(IL-IR)/(IL + IR)), where IL and IR were the emissions of left-handed and right-handed circularly polarized light. The generation of CPL and CPP could be understood based on a Muller matrix for a combination of polarizer and phase retarder25. The linearly polarized fluorescence (phosphorescence) would be produced after passing through the highly oriented dyed PVA layer (as the polarizer), and then CPP would be produced after passing through the transparent oriented PVA layer (as a phase retarder) in a twisted fashion (Fig. 2b). In total more than 108 possible combinations are estimated to achieve different CPP performance. Flexibly manipulating the CPP performance in a simple way always remains challenging15. By altering the above variable structure/processing parameters and their combinations, a continuous modulation of CPP performance including the sign, magnitude, peak position as well as FWHM could be achieved (Fig. 2c–e and Supplementary Figs. 10–12).
Fig. 2: LLMs recommendation and CPP performance of the hybrid films.
a Flowchart of molecular recommendation by LLMs. b Schematic illustration of the generation of CPP through the twisted stacking structure. CPP performance of the samples with (c) different glum values at around 650 nm by adjusting the thickness and strain of transparent oriented PVA layer. d Different peak position but similar glum values by varying the strain of transparent oriented PVA layer and the dye molecules. e Different FWHM by varying the combinations of the thickness and strain of transparent oriented PVA layer as well as the dye molecules.
Transfer learning model for forward prediction
Due to the richness of design space, precise forward prediction of the CPP performance of the hybrid films based on traditional trial-and-error approaches seemed unlikely, since these variable parameters often correlated with each other. In our previous work, we established a quantitative structure-spectrum-function relationship between the structural parameters, spectral features and circular dichroism (CD) properties of the twisted stacking structure, and realized precise forward prediction based on machine learning techniques29. Herein, the CD dataset of the twisted-stacking structures in our previous work was recruited to solve this problem. Predicting the CD spectra was adapted as a pre-training task for assisting the prediction of glum for CPP materials. To further enhance the accuracy of the forward prediction model, 135 combinations of parameters were employed to construct CPP hybrid films, and their CPL as well as CPP performance was measured. The fluorescence spectrometer was used to measure the emission spectra of the composite film in the presence of left-handed and right-handed circular polarizers. Then, according to the formula CPL = IL − IR, the circular polarized luminescent properties of the composite film were obtained. Furthermore, based on the formula glum = 2(IL-IR)/(IL + IR), the spectrum of glum varying with the wavelength was calculated, which was used as the data source for model training (Supplementary Figs. 13–15). This dataset was used for both training and validation of the forward prediction model. To efficiently simulate the glum spectra properties of the CPP materials, a forward prediction model based on transfer learning has been developed. Given the spectroscopic properties of glum, the forward prediction neural network was modeled as a multivariate regression problem. As shown in Fig. 3a, the model input consisted of two components: one was the spectral embedding descriptor, which included various structural and process parameters such as thickness, the stretching degree, grayscale, twist angle, and dye absorption. The composition of the spectral embedding descriptor vector is detailed in Supplementary Table 2. The other input is the emission spectrum of the phosphorescent molecule, exhibiting a wavelength range spanning from 400 to 700 nm and a data interval set at 0.2 nm. To illustrate the validity of the selected structural and process parameters, we analyzed the correlation among the four features using a correlation coefficient heatmap. The four elements in Fig. 3b represent the types of polarization layer dyes and phosphorescent molecules, the thickness of the phase delay layer, and the stretching degree of the phase delay layer, respectively. While the non-primary diagonal elements in the figure are close to 0, indicating that there is no linear correlation between the variables A, B, C, and D. The results indicated that these variables were completely independent of each other, which further validated the balance and effectiveness of the dataset (Fig. 3b). During the pre-training phase, an encoder-decoder architectural framework was utilized to map the structural and process parameters of the film into an embedding space, subsequent to which the embedded information was decoded for the generation of CD spectra. The latent representations in the embedding space were extracted, implicitly containing information on the absorption properties of chiral films, facilitating knowledge transfer and supporting the subsequent integrated modeling of glum spectra. In the fine-tuning phase, the model used the pre-trained encoder to generate latent features. These latent features were concatenated with the phosphorescent molecular emission spectra and fed into a Multi-Layer Perceptron (MLP), which output a reference vector x of the same dimension as the glum spectrum. Considering the different modulation mechanisms of process/structural parameters on the glum spectrum, a parameter encoder was also designed to encode the process/structural parameters into offset factor β and weight factor α. The implementation details of the parameter encoder were introduced in the Method-Model Architecture section. Using these two factors, the reference vector x was further adjusted to generate the final prediction result λ, as expressed in the following equation:
We established a model evaluation scheme with 10-fold cross-validation and employed five evaluation metrics to assess the model performance, including mean square error (MSE), mean absolute error (MAE), root mean square error (RMSE), coefficient of determination (R2) and Pearson correlation coefficient (Pearson). For different models, higher MSE, MAE, and RMSE values indicate worse model performance, while higher R² and Pearson values reflect better performance. We comparatively analyzed our model with several other machine learning regression models including Random Forest (RF), Support Vector Regression (SVR), and XGBoost algorithms. Our model demonstrated the best predictive performance, with the lowest MAE of 0.14 and the highest R² of 0.89. Furthermore, our model exhibited a relatively lower standard deviation in the predicted results, indicating its superior consistency and stability (Fig. 3c and Supplementary Table 3). Figure 3d visualizes the glum predictions of a test sample against experimental fittings, showcasing the model’s prediction capability. The predicted curve closely aligns with the experimental curve, achieving an excellent R² of 0.993. The points in the scatter plot predominantly follow the line of y = x, suggesting a strong consistency between the predicted and experimental values. Additionally, to evaluate the effectiveness of the parameter encoder and transfer learning, ablation experiments were designed to compare performance differences with the absence of various components. Compared to models missing the offset factor, weight factor, or both, the original complete parameter encoding resulted in better predictive performance, indicating that both encoding factors contribute to more effective learning of the structure-spectrum-function relationship (Fig. 3e and Supplementary Table 4). To demonstrate the necessity of adopting transfer learning, we compared our model with the model without transfer learning. The result showed our approach significantly improved prediction accuracy. The MAE decreased from 0.24 to 0.14 (Fig. 3f and Supplementary Table 5), demonstrating that the material latent representations extracted via transfer learning effectively capture the absorption characteristics of the film materials. In order to further illustrate the predicted effect of our model and its practical application value, we also explored the model’s zero-shot regression capability. We prepared CPP materials in which methyl orange was used for the dye layer in the chiral film and our model had not been trained on experimental data for this dye. The robustness and generalization potential of the method was demonstrated by comparing the curves given by the model with the curves obtained from the experiments (Fig. 3g). As a polarized dye present in a pre-trained database, it is possible to predict the glum spectrum of the CPP material made from this dye, even if the model has not been trained on related data.
Fig. 3: Transfer learning framework and forward prediction results.
a The framework of transfer learning. b Correlation heat map of the features. c Mean square error, mean absolute error, root mean square error, coefficient of determination and Pearson correlation coefficient histogram of the models. d Fitting curves of experimental and predicted results. e The curve of MSE, MAE, RMSE, R2 and Pearson correlation coefficient for models with different encoding factors. f Histogram of mean square error, mean absolute error, root mean square error, coefficient of determination and Pearson correlation coefficient for models with and without pretraining. g Zero-shot generalization of forward prediction model.
Inverse design of CPP materials
The established structure-spectrum-function relationship of CPP materials based on transfer learning can be further exploited in an inverse model for personalized customization of CPP materials with target functionality. Utilizing a forward prediction model, a virtual database was established, containing various predefined structural parameter combinations and their corresponding glum spectra, generated by the forward model. Therefore, the predefined structural parameter combinations expanded the existing dataset of 135 samples, which includes all possible combinations of 41 different stretching degrees (ranging from 80% to 120%) and other parameters within original ranges, totaling 1,107 data points. Researchers can formulate structural expert rules based on specific experimental scenarios and retrieve data from the virtual database according to these rules. After comparison and selection, the system will generate all possible structural parameter combinations that meet the required criteria, and the inverse design system was established, as outlined in Fig. 4a. The target CPP performance is user-defined and can take any form expressible from the CPP spectral features, e.g., a certain glum value (1.5) at a certain wavelength (450 nm, 550 nm, and 650 nm, respectively). Upon inputting the target glum value and corresponding wavelength into the inverse design system, a set of structure/process parameters combinations and phosphorescent molecule selections was generated, as shown in Fig. 4b. There are typically many available answers for a given target, since the hybrid films with different phosphorescent molecules and various structure/process parameters combinations would exhibit the same glum value at a certain wavelength. To evaluate the accuracy of the inverse design system, one of the possible parameters combinations was chosen to experimentally construct the hybrid film. As shown in Fig. 4c and 4d, the experimentally measured glum spectra matched well with the predicted spectra and most of the data points in the scatter plot align closely with the y = x line, with an R² value of 0.997, indicating its excellent accuracy. Figure 4e contains a histogram for the inversely designed and as-prepared films that can generate glum value of 1.5 at 450 nm, 550 nm, and 650 nm, respectively. The output of the inverse design system always consistently satisfies the specified design requirements (Supplementary Fig. 16), with errors maintained below 10%. Moreover, when a series of glum values (0.5, 0.9, 1.3, 1.5, and 1.7) at a certain wavelength (e.g. around 550 nm) were needed, the inverse design system can successfully report a set of possible selections of phosphorescent molecules and structure/process parameters combinations. Compared with the predicted spectra, the experimentally measured glum spectra exhibited excellent accuracy (Fig. 4f and Supplementary Fig. 17), confirming the success of our inverse design system for CPP films with target glum value at a certain wavelength.
Fig. 4: Reverse model for inverse design of CPP materials.
a Schematic diagram for the inverse design. b Parameters generated by inverse design. c Fitting curves and d scatter plots of experimental and predicted results. Actual values and expected values histograms for the samples of experimental realization using inverse design for the targets: e glum = 1.5 at 450 nm, 550 nm, 650 nm; and f glum = 0.5, 0.9, 1.3, 1.5, 1.7 at around 550 nm, respectively.
Inverse design of multicolor display
The successful achievement of the inverse design system in precise prediction of single glum values at a single wavelength for CPP materials inspired us to extend our method to multiple glum values at multiple wavelengths, to meet the demand of multicolor display. As illustrated in Fig. 5a, the resultant CPP color depended not only on the selection of phosphorescent molecules but also on the constitution of the hybrid films. Taking DBCz as an example, in the absence of the chiral layer, the DBCz/PVA hybrid films exhibited green broad bandwidth phosphorescent emission, as depicted in Fig. 5b and 5c. Overlaying a transparent oriented PVA layer (the stretching degree of 90% as a phase retarder) onto another highly oriented Congo Red dyed PVA layer (as a polarizer) was utilized to fabricate a twisted stacking chiral layer. It should be noted here that the formed chiral layer exhibited giant optical activity and frequency selective transmission, which can be exploited as a circular polarization-based filter (CPF) for multiplex color switching. As expected, two different phosphorescent spectra could be obtained through L- or R-CPF, respectively (Fig. 5b). Thus, a green-cyan color switching in phosphorescent emission could be observed when viewed through L-CPF. Similarly, green-reddish orange color switching in phosphorescent emission could be observed when viewed through R-CPF (Fig. 5c, chiral layer 1). As mentioned above, the effective phosphorescent color switching greatly depended on their giant optical activity and frequency selective absorption properties of the chiral layer. Interestingly, by only varying the stretching degree of the transparent PVA layer (e.g. 120%), the chiral layer exhibited corresponding red shift in CD characterizations (Supplementary Fig. 18). Therefore, different color switching in phosphorescent emission could be observed when viewed through L- or R-CPF, respectively (Fig. 5c chiral layer 2, and Supplementary Fig. 19). Notably, only increasing the stretching degree of the PVA layer induces an overall red shift in the glum spectrum (Fig. 5d and Supplementary Fig. 20), greatly enhancing its effective color switching range in phosphorescent emission. Each of the three different constitutions of the hybrid films (including ICz, DBCz and PA) could work as a phosphorescent emission switch among three prime colors. Together, these three systems constitute a circular polarization-based full-color phosphorescent display. Any color within the enclosed area in the CIE1931 chromaticity diagram could be achieved and further switched based on above inverse design model for selecting suitable phosphorescent molecules and structure/process parameters combinations. This color space covers ~145% of a typical sRGB color gamut found in commercial color displays (Fig. 5e). For comparison, the color gamut of our previous system without the help of inverse design only reached 67% of the sRGB. Furthermore, our method effectively narrows the full-width at half maximum (FWHM) of the emission spectra of the phosphorescent materials, which can be utilized to enhance the emission color purity. For instance, for DBCz/PVA, by selecting a chiral layer with Congo Red dye, thickness of 80 μm, and stretching degree of 110%, the color purity was increased to about 95%, with the corresponding CIE coordinates being (0.480, 0.503) (Supplementary Fig. 21).
Fig. 5: CPP materials towards circular polarization-based multicolor display.
a Schematic diagram for the CPP materials with controllable color switching performance in phosphorescent emission. b The phosphorescence (Ph), CD and circularly polarized phosphorescent (CPP) spectra of the DBCz/PVA hybrid chiral films. c Different color switching performance could be achieved by only varying the stretching degree of PVA layers (90% and 120%, respectively). d The influence of stretching on the glum value of the hybrid films. The data is sourced from a virtual database, which was generated under the following conditions: the phosphorescent molecule is DBCz, the dye of the polarization layer is Direct Blue 71, the thickness of the phase retardation layer is 48 μm, and the stretching degree of the phase retardation layer is varied from 80% to 120%. The interval of the stretching degree between each group of data is 1. e Color gamut of the CPP materials in CIE1931 chromaticity diagram. A typical sRGB color gamut is shown for comparison.
Inverse design of 4D encrypting information
It would be highly desirable to design and controllably synthesize CPP materials with programmable CPL and CPP performance, e.g. with the same or opposite handedness, which could serve as unique photonic components in chiral optoelectronics and nanophotonics. However, the customized manufacturing of CPP materials with tailorable CPL and CPP properties would be very difficult based on traditional techniques such as the utilization of chiral dopants or chiral liquid crystals. But the success of inverse design for the chiral layer with multiple gabs values at multiple wavelengths allows us to easily customize programmable CPL and CPP performance from the same phosphorescent layer. As shown in Fig. 6a, the target function was user defined (CPL and CPP performance with the same or opposite handedness) and was input into the computational brain, which could be processed with the inverse design model and a set of structure/processing parameters that approaches the target function were reported. For example, we defined left-handed blue CPL but right-handed green CPP as the target function, which was input into the inverse design model, and the suitable structure/processing parameters were given (Supplementary Fig. 22). As expected, only blue CPL “中” image could be viewed through L-CPF, while green CPP “中” image could be viewed through R-CPF, indicating that the produced sample exhibited the customized CPL and CPP performance with opposite handedness (Fig. 6b). Interestingly, we defined right-handed blue CPL and green CPP as the target function, and the sample could be successfully fabricated with right-handed blue CPL and green CPP performance based on the inverse design model. Similarly, the target function with same handed blue CPL and CPP or opposite handed blue CPL and red CPP performance could be realized successfully (Supplementary Figs. 23 and 24), experimentally demonstrating the excellent accuracy of our inverse design model. All above results indicated the successful customization of programmable CPL and CPP performance at multiple predesigned wavelengths.
Fig. 6: Inverse design and 4D information encryption.
a Schematic diagram for the inverse design of CPP films with programmable CPL and CPP performance and b their fluorescence and phosphorescence images viewed with naked eyes, through L- or R-CPF, respectively. c Encoding terminology of 4D information encryption and decryption.
Further, we attempted to combine their programmable CPL and CPP characteristics to achieve 4D information encryption. As shown in Fig. 6c, the color of the phosphorescence emission is used as the first dimensional code (1stD code). The blue phosphorescent emission represents ‘0’, while others represent ‘1’. The handedness of the CPP could be used as the second dimensional code. The left-handed CPP represents ‘0’, while right-handed CPP represents ‘1’. As mentioned above, different color switching in phosphorescent emission could be observed through L- or R-CPF, respectively. By varying only the stretching degree of the PVA layer, different color switching range in phosphorescent emission could be achieved even from the same phosphorescent layer. Similarly, programmable CPL and CPP properties with same or opposite handedness could also be achieved from the same phosphorescent layer. Consequently, the effective color switching in phosphorescence emission and programmable CPL and CPP properties could be used as two other dimensions of encryption (Fig. 6c). Such a 4D bar code is much more complex than the other traditional system using only color, lifetime and chirality as the dimensions of encryption. Taking advantage of the above multi-mode characteristics of our 4D information encryption system, the information capacity and security would be greatly enhanced, holding significant potential applications for safeguarding valuable and authentic information.
Discussion
In summary, we demonstrated a workflow for the on-demand customization of functional materials with programmable CPL and CPP performance based on LLMs and transfer learning methods. The relationship between the structure of functional materials and their CPL and CPP performance has been built, allowing us to not only facilitate accurate forward prediction of CPP performance from experimental structure/process parameters, but also provide valuable guidance for personalized customization of CPP materials with target functionality. We demonstrate the significant advantages of transfer learning with limited chemical data, and facilitate the precise fabrication of CPP films with high glum up to 1.86, narrow full-width at half-maximum (FWHM, 49 nm) and customize CPP performance with targeted spectral position across the full visible spectrum. As a result, the on-demand customized CPP materials were exploited to fabricate programmable chiroptical components towards multiple practical applications including multicolor display and 4D multi-mode information encryption.
It is anticipated that these findings will not only shed light on the inverse design of chiroptical materials and devices with target functionalities, but also is of great fundamental value for discovering and optimizing functional materials for cutting-edge applications based on ML-driven method.
Methods
Materials and instruments
All chemicals were used without further purification. PVA (Mw:445000 g/mol), Direct Blue 71, Direct Yellow, and Congo Red dyes were purchased from Aladdin. The PVA films (30 μm, 48 μm, 80 μm) were purchased from New Blue Sky Materials Industry Co., Ltd. 7H-dibenzo[c,g]-carbazole (DBCz), 11,12-dihydroindolo[2,3-a]carbazole (ICz) and 1-pyrenylboronic acid (PA) were purchased from Aladdin, respectively. The DBCz/PVA, ICz/PVA and PA /PVA hybrid phosphorescent films were separately prepared upon drop casting method47,48,49. Chiral films with twisted-stacking structure were constructed by a modified method in analogy to the previous procedure29. The top transparent highly stretched PVA layer acts as a phase retarder, while the bottom stretched dyed PVA layer (with Direct Blue 71, Congo Red and Direct Yellow, respectively) serves as a polarizer. The stretching degree γ of PVA layer was characterized as the ratio between the increased length and the original length upon stretching. The bottom dyed PVA layer was rotated in either a clockwise or counter-clockwise fashion with respect to the top layer to generate chiroptical performance. Then the combination of above phosphorescent film and chiral films could generate both CPL and CPP performance.
Characterization
UV-vis absorption spectra were measured by a Shimadzu UV-2700. The fluorescence spectrum was obtained by a fluorescence spectrometer (SHIMADZU RF-6000), while phosphorescence luminescence spectrum was obtained by other fluorescence spectrometer (Hitachi F-4700). The asymmetry factor of luminescence was calculated based on glum = 2×(IL-IR)/(IL + IR) (the detail was described in Supplementary Fig. 25). In order to verify the correctness of the method, circular polarized photoluminescence was directly measured using a JASCO CPL-300 spectrometer. CD characterization was performed by a commercial CD spectrometer (JASCO-1500). Photographs were captured using a Redmi K50 smartphone under UV lamp illumination at room temperature.
Dataset
The dataset used for pre-training contains 1,493 samples, all of which were collected by our group in previous work29. Each sample includes four structural/process parameters, the selective absorption of dye molecules, and the corresponding CD spectra of the material.
The Fine-tuning dataset contains two sets of samples, each consisting of 135 samples. The input information for each sample includes the phosphorescent and fluorescence luminescence spectra of phosphorescent molecules (corresponding to the two groups of samples), the selective absorption of dye molecules, and four structural parameters. Specifically, three luminescent molecules, three dye molecules (Direct Blue 71, Direct Yellow, Congo Red), three film thicknesses (30 μm, 48 μm, 80 μm), five stretching degrees (80%, 90%, 100%, 110%, 120%), one grayscale, and one twist angle were selected. Each set of samples was fully sampled for all possible combinations. Correspondingly, the output of each sample is the glum spectrum of the CPP material fabricated under the given input parameter combination.
Model architecture
Forward prediction model includes two key components: pre-training task and parameter-encoder. In the pre-training task, the encoder employs a 5-layer MLP, while the decoder utilizes a 1-layer MLP.
The parameter encoder selectively encodes input features, to generate the offset factor (β) and the weight factor (α). These factors are implemented using MLPs and trainable embeddings. In particular, the offset-related features, which include thickness, dye species, and stretching properties, are first processed through a 3-layer MLP. The output is then modulated via element-wise multiplication with a learnable parameterized embedding to obtain the offset factor. Similarly, the weight-related feature, which is thickness, is processed through a dedicated 2-layer MLP to generate the weight factor. This factor is further constrained within the range [−1.5,1.5] to prevent excessive scaling effects. Since thickness and dye species are discrete variables, one-hot encoding is applied before processing.
All models were constructed and trained using the PyTorch framework. The forward prediction model consists of 5.56 M parameters, while the pre-trained autoencoder contains 0.41 M parameters. To expand the training dataset, Gaussian noise with a standard deviation of 1 × 10−7 was added to absorption properties of the dye before training, so that the dataset was extended tenfold.
Ablation study
The ablation study focuses on two key modules: transfer learning and the parameter encoder. Firstly, to verify the necessity and effectiveness of transfer learning, we examined the model’s performance without the pre-training task. Specifically, the spectral embedding descriptors were concatenated with the emission spectra of phosphorescent molecules and directly fed into the MLP, bypassing the encoder-decoder structure. Additionally, in order to analyze the effectiveness and influence of the two factors from the parameter encoder, three model variants were designed: (1) excluding the weight factor α, (2) excluding the offset factor β, and (3) excluding both. These three cases are where α equals [1], β equals [0], and both α equals [1] and β equals [0] together.
Hyperparameters setting
The hyperparameter batch_size is set to 128, the initial learning rate is 5×10−5, and the attenuation rate is 0.7. The optimization algorithm used in the model is Adam, with training over 300 epochs. All training hyperparameters and model configuration details are provided in Supplementary Table 6. In addition, in order to make the most of the dataset and evaluate the overall performance of the model, 10-fold cross-validation was employed. As a result, all reported predictions are based on the entire dataset, without relying on single data split. Finally, five independent repeated experiments were set up and showed similar results.
Hardware setup and software environments
The experiments were conducted on a computing system equipped with an NVIDIA RTX 2080 Ti GPU (11GB VRAM) and a 12 vCPU Intel Xeon Platinum 8255 C CPU @ 2.50 GHz, supported by 40GB system memory. The software stack included CUDA 11.8 for GPU acceleration, Python 3.8 as the primary programming language, and Ubuntu 20.04 LTS as the operating system. The deep learning framework PyTorch 2.0.0 was utilized for model implementation and training, ensuring compatibility with the specified hardware configuration. A complete forward prediction cycle (training and validation) required approximately 9.5 min on this setup. All experiments were performed under this environment to maintain consistency in computational performance and reproducibility.
Journal
Jul. 13, 2025