Open Access
27 August 2024 Machine learning for perovskite optoelectronics: a review
Feiyue Lu, Yanyan Liang, Nana Wang, Lin Zhu, Jianpu Wang
Author Affiliations +
Abstract

Metal halide perovskite materials have rapidly advanced in the perovskite solar cells and light-emitting diodes due to their superior optoelectronic properties. The structure of perovskite optoelectronic devices includes the perovskite active layer, electron transport layer, and hole transport layer. This indicates that the optimization process unfolds as a complex interplay between intricate chemical crystallization processes and sophisticated physical mechanisms. Traditional research in perovskite optoelectronics has mainly depended on trial-and-error experimentation, a less efficient approach. Recently, the emergence of machine learning (ML) has drastically streamlined the optimization process. Due to its powerful data processing capabilities, ML has significant advantages in uncovering potential patterns and making predictions. More importantly, ML can reveal underlying patterns in data and elucidate complex device mechanisms, playing a pivotal role in enhancing device performance. We present the latest advancements in applying ML to perovskite optoelectronic devices, covering perovskite active layers, transport layers, interface engineering, and mechanisms. In addition, it offers a prospective outlook on future developments. We believe that the deep integration of ML will significantly expedite the comprehensive enhancement of perovskite optoelectronic device performance.

1.

Introduction

Metal halide perovskite materials are considered one of the most promising materials for the next generation of optoelectronic devices, attributed to their high light absorption coefficient, large carrier mobility, and simplicity in synthesis methods.15 These characteristics have significantly increased the focus on perovskite materials within the scientific community, marking them a subject of growing research and development interest. Over the past few years, perovskite optoelectronic devices have witnessed remarkable advancements and widespread application across various fields.610 Notably, perovskite solar cells (PSCs) have attained power conversion efficiencies exceeding 25%,9,1114 and perovskite light-emitting diodes (PeLEDs) have broken through the 20% external quantum efficiency (EQE) barrier.1526 Furthermore, significant progress has been made in the development of perovskite detectors and lasers.2732 However, traditional approaches to material design and device fabrication often rely on trial-and-error processes. These methods are time-consuming and inefficient, hindering further performance enhancements in these devices.

Machine learning (ML) is a subfield of artificial intelligence that utilizes computer science, statistics, and mathematics to develop models and algorithms capable of automatically learning from data and making predictions based on acquired knowledge. Its data analysis capabilities, automatic decision-making processes, and adaptive model adaptation make it a powerful tool for optimizing research processes and minimizing experimental waste.33 In the perovskite materials, researchers are increasingly leveraging ML techniques to streamline the research process, notably by reducing the consumption of experimental materials and saving time.

Metal halide perovskites can generally be classified into three-dimensional (3D) and low-dimensional structures. 3D perovskites are described by the general chemical formula ABX3, where A signifies organic or inorganic cations, such as cesium (Cs+), formamidinium (FA+), and methylammonium (MA+), B denotes divalent metal cations, such as lead (Pb2+) and tin (Sn2+), and X represents halide ions including iodide (I), bromide (Br), and chloride (Cl). When A is a large organic cation that exceeds the size restrictions imposed by the tolerance factor for 3D perovskites, such as phenylethylammonium (PEA+) or 1-naphthylmethylammonium (NMA+), the perovskite structure changes, resulting in the formation of a low-dimensional perovskite and subsequently altering its optoelectronic properties.34,35 ML has been utilized to explore perovskite materials and predict their bandgap and optical properties, offering crucial technical support for the development of perovskite materials.36 In addition, ML has played a pivotal role in investigating the stability of lead halide perovskites. This has enhanced our understanding of the various factors influencing the stability of perovskite materials, thereby facilitating the development of stable devices.37 Furthermore, ML has been employed to optimize the crystallization process of perovskites. By fine-tuning synthesis parameters, it enables faster and more controlled crystallization, which is vital for the fabrication of high-quality perovskite crystals.38

Over the last 2 years, ML has significantly influenced the development of perovskite optoelectronics, showcasing a trend toward multifaceted collaborations.3944 ML is being strategically leveraged to tackle the practical challenges within perovskite optoelectronic devices, including the optimization of perovskite active layers, the selection of transport layers, and the elucidation of underlying functional mechanisms. Interpretable ML methodologies, such as SHapley Additive exPlanations (SHAP) values, are increasingly being applied to analyze and comprehend the factors affecting the performance of perovskite devices. The reliability of ML insights is often validated by harmonizing them with empirical device data and corroborative density functional theory (DFT) calculations. This review highlights the critical role of ML in boosting the efficiency of perovskite optoelectronic devices, encompassing developments, current challenges, and future prospects. It succinctly outlines the essential contribution of ML to the progression of perovskite technologies, with a special focus on solar cells and LEDs. The aim is to encourage interdisciplinary collaboration between perovskite optoelectronics and artificial intelligence, highlighting the significant potential for revolutionary advancements in optoelectronic technologies through this synergistic partnership.

2.

Fundamentals of Machine Learning

2.1.

Machine Learning Workflows

ML workflows usually include data collection, feature engineering, model selection, and performance evaluation (Fig. 1).42,45,46 Data collection is the foundation of ML, where the key lies in selecting appropriate data sources and ensuring the accuracy of the data. In the perovskite optoelectronic devices, data typically originate from laboratory measurements, as well as results from computational simulations. In addition, it is necessary to label data samples according to the type of algorithm selected for the task.

Fig. 1

Flowchart of ML-assisted preparation of perovskite optoelectronic devices. This includes the fabrication of devices, data acquisition, feature engineering, selection of appropriate models, and performance evaluation. The models encompass a range of learning algorithms: supervised, unsupervised, and semi-supervised.

AP_6_5_054001_f001.png

Following data collection, feature engineering is undertaken with the objective of converting raw material data into a format that ML algorithms can interpret and utilize effectively. This process is crucial for uncovering underlying correlations and physicochemical properties of the materials in question. Feature engineering encompasses several key parts, including feature extraction, feature selection, and feature construction. Feature extraction is the process of distilling valuable features from the raw data. Feature selection focuses on identifying the most relevant or representative features, which helps in reducing the dimensionality of the data and enhancing the model’s generalization capability. The Pearson correlation coefficient and variance threshold are commonly used methods. The Pearson correlation coefficient measures the linear relationship between pairs of features, filtering out highly correlated ones.47 The variance threshold method sets a threshold value, retaining only features with variance exceeding this threshold.48 Lastly, feature construction is about generating novel features through the combination, transformation, or derivation of existing ones, thereby bolstering the model’s ability to capture and express complex patterns and relationships.

Subsequently, the selection of ML models is contingent upon factors, such as the complexity of the task at hand and the specific characteristics of the dataset. Tasks are generally classified as either classification or regression types, with data often encompassing a range of formats including numerical values, text, and images. The most suitable ML model is selected tailored to the particular requirements of the task and the data’s attributes. Popular models frequently employed in such scenarios include random forests (RFs),49 support vector machines (SVMs),50 and diverse variants of neural networks.51

Performance evaluation is the process of assessing an ML model’s predictive capability and accuracy. By evaluating the performance of different models on a given dataset, the most suitable model can be selected for subsequent training and prediction. The evaluation methods for models vary depending on the task. Common evaluation metrics in classification tasks include precision and accuracy, while in regression tasks, metrics, such as mean squared error (MSE) and mean absolute error (MAE), are frequently used. MSE is the average of the squared differences between predicted and actual values. A smaller MSE indicates lower error and thus more accurate predictions. MAE is the average of the absolute differences between predicted and actual values. Similarly, a smaller MAE signifies lower error, reflecting more accurate predictions.

With the advancement of ML applications, there is a heightened emphasis on deciphering the reasoning behind model decisions. This trend is highlighted by the increasing adoption of SHAP values, a sophisticated method for interpreting ML model predictions.52 At the heart of SHAP is the concept of each feature’s marginal contribution to the model’s prediction, where “marginal contribution” signifies the impact of an individual feature on the predictive result of the model. SHAP endows the capability for both overarching (global) and specific (local) explanations of model behaviors. Global explanations shed light on the significance of each feature across the entire model, whereas local explanations delve into the rationale behind the model’s specific predictions for individual instances. In practical scenarios, the explanatory insights of SHAP are graphically represented, predominantly through SHAP plots. These plots intuitively exhibit the positive or negative influence of each feature on a particular instance. By analyzing SHAP plots, one can gain insights into which features are pivotal in driving specific predictive outcomes.

2.2.

Machine Learning Algorithms

ML algorithms cover a spectrum of approaches including supervised, unsupervised, and semi-supervised learning techniques (Fig. 1). Supervised learning is an ML method that involves training a model using labeled data sample.5355 The objective of this approach is for the model to learn the mapping between input features and output labels, enabling it to predict or classify new input features. In classification tasks, the model’s output is categorical, effectively segregating the data into defined groups.5659 Conversely, in regression tasks, the output is continuous, yielding a numerical prediction for the input data.60 Interestingly, classification and regression can be amalgamated to tackle certain types of problems.61 Among the common algorithms employed in supervised learning is the decision tree algorithm. This algorithm creates a tree-like model by systematically dividing the data based on its features, with each terminal node (or leaf) representing a specific category or value.62 Another notable algorithm is the SVM, which strives to maximize the margin between different categories. It achieves this by locating an optimal hyperplane within the feature space. Lastly, gradient boosting stands out as an ensemble learning method. It enhances the performance of a model by iteratively developing a “weak” classifier and continuously refining its predictive accuracy. This refinement process involves minimizing the gradient of the loss function, thereby progressively building a more robust classifier.63

Unlike supervised learning, unsupervised learning is an ML method that focuses on uncovering patterns, structures, or relationships in unlabeled data.6467 This method operates without predefined labels or output values, focusing solely on the input data. Unsupervised learning in the field of material science mainly performs tasks, such as data clustering, dimensionality reduction, or using natural language processing to retrieve materials from literature by discovering the internal structure and pattern of data.68,69 Prominent algorithms in unsupervised learning include Gaussian mixture models (GMMs).70 These models are used for data clustering and modeling, based on the premise that the data consist of multiple Gaussian distributions. GMMs work by estimating parameters to find the most accurate fit to these distributions. Another notable algorithm is self-organizing maps (SOMs), a type of neural network algorithm.71 SOMs organize similar input samples into adjacent neurons in a self-regulating manner, proving useful for data visualization and clustering. This approach is particularly effective for mapping high-dimensional data into a more manageable, low-dimensional space.

Semi-supervised learning is an ML method that combines supervised learning and unsupervised learning. It leverages a combination of labeled and unlabeled data during the training process.7275 By utilizing unlabeled data alongside labeled data, semi-supervised learning aims to enhance the generalization ability and performance of the model.75 A notable example of semi-supervised learning is the self-training method.76 This technique begins by training an initial model using labeled data. Once trained, this model is then applied to make predictions on unlabeled data. Samples for which the model has high prediction confidence are added to the labeled dataset. The model is then retrained with this augmented data, and this iterative process continues until the model reaches a point of convergence. Another approach within semi-supervised learning is co-training.77 This method involves training multiple classifiers simultaneously, with each classifier based on the assumption that different perspectives or feature sets exist within the data, and each provides independent and valuable information. In co-training, each classifier is trained using features from one specific view. The classifiers then use the predictions from one view to aid in the filtering and labeling of the unlabeled data from another view.

3.

Machine Learning in Perovskite Optoelectronic Devices

Fabrication of high-efficiency perovskite optoelectronic devices relies on various factors, including the composition and structure of the perovskite materials, device architecture, and fabrication methodologies. Conventional empirical (trial-and-error) methods have demonstrated limited efficacy in optimizing these dimensions. In recent years, ML has emerged as a powerful tool for leveraging existing data to uncover intrinsic patterns, thereby facilitating the realization of high-performance perovskite optoelectronics. In this context, we primarily focus on the application-centric research of ML within perovskite photovoltaics and PeLEDs.

3.1.

Perovskite Solar Cells

PSCs represent a promising photovoltaic technology due to their superior optical absorption properties and charge transport capabilities. These PSCs feature distinct layers, including a light-absorbing layer, electron transport layer (ETL), hole transport layer (HTL), anode, and cathode. The light-absorbing perovskite material plays a pivotal role in efficiently converting light energy into electrical energy. To enhance the stability and power conversion efficiency (PCE) of PSCs, ML has already been applied to various aspects, such as the device structure, perovskite layer, transport layer, and interface engineering. This innovative application offers critical insights, expedites the optimization processes, and broadens the comprehension of PSC mechanisms. It holds substantial potential for further enhancing both the stability and efficiency of PSCs, marking a significant advancement in renewable energy technologies.

3.1.1.

Optimizing surface capping layer

The degradation of perovskite films directly impacts the lifetime of perovskite photovoltaic devices. As a result, surface treatment of perovskites to form a low-dimensional perovskite capping layer has emerged as an effective strategy to enhance the stability of these photovoltaic devices. Nevertheless, a significant challenge persists in identifying appropriate materials for surface modification. Recently, ML has garnered significant attention in the development of passivation materials for perovskite photovoltaics.7881 The primary emphasis has been on the prediction and design of materials for passivation. This involves compiling a database of passivation materials from literature and then employing ML for predictions, with the use of SHAP to interpret the importance of features. In addition, there is a significant trend in comparing the results of ML predictions with actual experimental outcomes in devices, as well as with insights from DFT calculations. This comparison serves to further substantiate the reliability of ML applications.

Hartono et al. developed an ML method to select overlay materials to suppress perovskite degradation.78 They collected 21 organic salts from the PubChem database as overlay materials, using the structure and chemical characteristics of organic molecules as well as device fabrication conditions as model inputs, while using the initial degradation values and degradation rates of perovskites as model outputs [Fig. 2(a)]. Regression models [Fig. 2(b)] and SHAP value analysis [Fig. 2(c)] were employed to analyze the relationship between features and stability. It was found that a smaller number of hydrogen bond donors and a smaller topological polar surface area in organic molecules are associated with increased stability of the MAPbI3 films. The best-performing organic halide, phenethyltriethylammonium iodide (PTEAI), successfully extended the stability lifetime of MAPbI3 by 4±2 times compared to control MAPbI3. Moreover, characterization techniques (X-ray photoelectron spectroscopy and Fourier-transform infrared spectroscopy) have also revealed that PTEAI effectively forms a capping layer by modifying the surface structure and chemistry. This alteration correlates with the suppression of methylammonium loss and the formation of both PbI2 and oxygen-containing compounds at the surface of the perovskite, ultimately achieving the goal of protecting the perovskite from degradation.82

Fig. 2

(a) Schematic illustration of design rules for capping layer of PSCs. (b) The cross-validated root mean square error of various machine-learning models. (c) The feature importance ranking obtained from the RF regression algorithm and SHAP library, showing the chemical properties and processing conditions in descending order of importance (rank). The yellow and purple color indicates high and low values of a given feature, respectively. Reproduced with permission from Ref. 78 (CC-BY).

AP_6_5_054001_f002.png

Recently, Zhi et al. collected a dataset of 46 experimental data points documenting the improvement in power conversion efficiency after interface passivation with 19 different ammonium salts at various concentrations.79 These ammonium salts encompassed 14 from the initial dataset and five additional data selected via Latin hypercube sampling (LHS). LHS achieves diversity and uniformity sampling by dividing the range of values for each parameter into equally spaced subintervals and ensuring that there is only one sample point within each subinterval. The model inputs were 12 molecular descriptor features and the concentration of the ammonium salt precursor solution. The model output was the rate of improvement in device PCE. It is noteworthy that, to enhance precision, they employed an ensemble learning method that integrates five distinct ML regression models with various architectures to train and make predictions on the dataset. Similar to the previous work, SHAP analysis was also utilized to investigate the impact of molecular features on device efficiency, identifying hydrogen bond donors, hydrogen atoms, and the molecular lipid−water partition coefficient (molecular LogP or MolLogP) as the three most critical molecular features for the passivating ammonium salts. Ultimately, they employed the trained ML model to screen 10 suitable ammonium salts from 112 candidates in the PubChem database and experimentally validated six different ammonium salts. Among them, 2-phenylprop-1-ammonium iodide achieved PCEs of 22.36% and 24.47% for FAMACs and FAMA-based PSCs, respectively.

3.1.2.

Degradation mechanism

Despite numerous strategies having been employed to enhance the stability and power conversion efficiency of perovskite photovoltaics, the factors affecting stability and PCE remain elusive. Hu et al. constructed an ML regression model to investigate the key factors influencing the efficiency and stability of PSCs (Fig. 3).83 They selected 102 sets of PSC device data from published literature. Through feature engineering, they retained key parameters: Eg (bandgap), size (the grain size of perovskite films, inversely representing grain boundary density), Rn (surface roughness at the interface between the perovskite film and the HTL), Trap (trap density), and TRPL (average fluorescence lifetime from time-resolved photoluminescence measurements) as model inputs. These were utilized with the aim of predicting two critical outcomes: power conversion efficiency and stability. Several ML regression models containing support vector regression (SVR) and ridge regression were employed to discern the relationships between model inputs and outputs. Among these, SVR demonstrated the best performance. Moreover, it was found that the bandgap had the greatest impact on efficiency, while surface roughness and grain size predominantly influenced stability. Finally, guided by predictive insights, various annealing temperatures were strategically employed to modify grain sizes, and three unique organic molecules were utilized to change surface roughness. This approach yielded PSCs with an impressive 23.4% efficiency and exceptional long-term stability, maintaining 97.6% of their initial efficiency after an extended period of 3288 h under ambient conditions. It is noteworthy that the results from DFT calculations also show a remarkable correlation with the predictions of the ML model.

Fig. 3

Working principle of ML: relevance analysis—experimental verification—one with the best properties. Complex multivariable analysis by ML. PVK is perovskite. Reproduced with permission from Ref. 83 © 2022 Wiley‐VCH.

AP_6_5_054001_f003.png

3.1.3.

Optimizing electron transport layer

An ETL that matches the energy levels of the perovskite layer and exhibits high electron mobility is fundamental for ensuring efficient electron injection and extraction. In addition, it can impact the density of defect states at interfaces between the perovskite and the ETL, thereby preventing detrimental interfacial recombination. As a result, an appropriate ETL is essential for the performance of perovskite photovoltaic devices. This indicates the need not only to choose suitable ETL materials but also to perform surface modification or doping of the selected layer, to ensure efficient electron transport, and to facilitate perovskite crystal growth.

She et al. utilized a two-step ML approach, involving classification and regression, to study suitable ETLs (Fig. 4).80 Initially, they extracted 1820 performance data points of PSCs from 795 articles published between 2013 and 2020, forming the first dataset. The data were then categorized based on the materials of the ETLs, dividing them into groups, such as TiO2-based, SnO2-based, and other ETL-based groups. In this dataset, the ETLs were undoped. To prevent sample imbalance, the samples were further classified into three categories based on their efficiency: category A for efficiency below 9%, category B for efficiency ranging from 9% to 18%, and category C for efficiency above 18%. By selecting nine features that influence the power conversion efficiency through feature engineering, they used these as model inputs with power conversion efficiency as the output, training different ML classification models on this dataset. Ultimately, they further investigated feature importance based on the classification model, finding that TiO2 or SnO2 as ETL materials could lead to high-performance PSCs.

Fig. 4

Flow chart of the two-step ML in this work, which includes two steps, first-step ML (left side) and second-step ML (right side). Reproduced with permission from Ref. 80 © 2021 Royal Society of Chemistry.

AP_6_5_054001_f004.png

Subsequently, to further improve the PCE, they focused on the impact of doping ETLs. A second dataset was created, comprising 90 device data points for PSCs based on doped SnO2 and 96 for PSCs based on doped TiO2. The physical and chemical properties and concentrations of the doping elements were used as model input features. They calculated the efficiency improvement ratio (EIR), which is the PCE of PSCs with doped ETL divided by the PCE of corresponding PSCs with undoped ETL, and used EIR as the model output. Using the RF model and genetic algorithms (GAs) for data training, they predicted doped ETLs that might lead to high EIR. Finally, based on predictions, they achieved PCEs of up to 30.47% and 28.54% for Cs-doped TiO2 ETL in CsFAMA-based PSCs and S-doped SnO2 ETL in FAMA-based PSCs, respectively.

3.1.4.

Optimizing lead-free perovskite

In order to develop environmentally friendly lead-free PSCs, ML is utilized to optimize the device architecture and components of Sn-based PSCs. Bak et al. initially collected real experimental data from 122 different photovoltaic devices with the perovskite structure ASnI3 from 49 published journal articles. The data include information on perovskite chemical structure, metal electrodes, transparent electrodes, HTL, and ETL as input features, while the corresponding short-circuit current density (JSC), open-circuit voltage (VOC), and fill factor (FF) values are considered as output features. A deep neural network (DNN) algorithm was then developed, and data augmentation was performed using K-fold cross-validation to randomly automate hyperparameter optimization of the DNN model to obtain the model’s pre-trained weights. The performance of the model was assessed by comparing the experimental data’s PCE values with the DNN’s predicted values for each layer (i.e., perovskite layer, metal electrode, transparent electrode, HTL, and ETL). Finally, by random combination of a large number of input features, predictions were made through the optimized DNN model (Fig. 5). ML recommended a new Sn-based PSC device structure FTO/PEDOT:PSS/EDA0.01PEA0.07Cs0.03FA0.51MA0.38SnI3/indene-C60 bisadduct (ICBA)/2,9-dimethyl-4,7-diphenyl-1,10-phenanthrolin (BCP)/Ag. Obviously, the composition of the perovskite is still extremely complex. Furthermore, experimental verification has confirmed that the PCE of the ML-optimized Sn-based perovskite photovoltaics reached 5.57%, which is a significant improvement over the traditional testing results (PCE of 1.72%).84

Fig. 5

Schematics of Sn-PSC design ML recommendation. (a) Parameter setting of each feature for the recommendation process. (b) Schematic of recommended Sn PSCs. (c) Labeled number and corresponding materials used as other layers in (a). Reproduced with permission from Ref. 84 (CC-BY).

AP_6_5_054001_f005.png

3.1.5.

Vapor-deposited perovskite solar cells

Although the performance of solution-processed perovskite photovoltaics has rapidly improved in laboratory settings, scaling up to large-scale production remains a significant challenge. Vapor-deposited PSCs, which need to be compatible with the existing large-scale electronic industry, have substantially lagged behind. Wang et al. extracted information on 220 devices from 150 published papers on thermal evaporation PSCs between 2015 and 2022. Initially, the data were cleansed, reducing the sample size to 180 groups, and missing values in the dataset were filled with the mode of each column. The dataset was then expanded using Gaussian noise, tripling the sample size. Twenty features were chosen as input variables, encompassing aspects such as the material composition of the perovskite layer and device fabrication methods, with the device’s JSC, VOC, FF, and PCE as output variables. Training was conducted using 10 different ML models, with the RF model scoring highest on the test set. To enhance the generalization ability of the ML model, it was combined with a GA, increasing the prediction accuracy of the GA-enhanced RF model on the test set from 60.2% to 67.4%. Subsequently, the importance of features influencing PCE was analyzed through the mean Gini index of feature contributions in each decision tree of the RF model. The study found that the five features contributing most to the PCE were the proportions of MA, FA, Br, annealing temperature, and pressure (Fig. 6). Finally, the optimal device structure proposed was indium tin oxide (ITO)/SnO2/BCP/FA0.25MA0.25Cs0.5Pb(I0.75Br0.25)3/MeO-2PACz/Ag, achieving the highest PCE of 26.1% under conditions of 90°C to 105°C and an evaporation pressure of 3×105 to 4×105  mbar. This work not only identified the optimal conditions for vapor-deposited perovskite but also, more crucially, delineated the direction for vapor-deposited perovskite.85

Fig. 6

ML flowchart for studying the variables that influence the performance of vapor-deposited PSCs, consisting of four main parts: building the dataset, feature engineering, SHAP interpretation model, and model prediction. Reproduced with permission from Ref. 85 © 2023 Royal Society of Chemistry.

AP_6_5_054001_f006.png

3.1.6.

Machine vision for large-area slot-die coated films

In the commercialization of perovskite photovoltaics, the controllable fabrication of large-area homogeneous perovskite films represents a critical technological challenge. Traditionally, the assessment of uniformity in large-area films has relied primarily on visual inspection, which is imprecise. Taherimakhsousi et al. developed a rapid, reliable, and non-destructive machine vision approach to quantify the uniformity of large-area perovskite films, thereby aiding process optimization.86 Initially, white light photography was employed to sample slot-die coated perovskite films. Subsequently, a convolutional neural network based on the VGG16 architecture successfully segmented the original images into fully covered,87 partially covered, and uncovered regions without the need for manual intervention. Optimized for pixel resolution, this approach spatially quantified multiple attributes of perovskite films (such as substrate coverage, film thickness, and defect density) from 25  cm2 samples and established correlations between slot-die coating process parameters (wet film thickness and gas knife speed) and film properties (Fig. 7). This strategy facilitated multi-parameter, multi-objective optimization of the gas-knife assisted slot-die coating process, enabling the identification of optimal process conditions that simultaneously maximize coating throughput, film quality, and predicted device current density while reducing the characterization effort required for each process condition. This work demonstrates the significant potential of machine vision in optimizing large-area photovoltaic films.

Fig. 7

Machine vision workflow for quantifying large-area perovskite film morphology from optical images using the PerovskiteVision tool. Reproduced with permission from Ref. 86 (CC-BY).

AP_6_5_054001_f007.png

3.2.

Perovskite Light-Emitting Diodes

Perovskite LEDs are semiconductor devices that transform electrical energy into light, featuring an optically active layer constituted by perovskite materials. Their primary performance indicators include EQE, calculated as the ratio of photons emitted from the LED to the electrons circulating in the external circuit, and half-life (T50), representing the duration for brightness (or efficiency) to decrease by half.

3.2.1.

Predicting additive molecules

The quality of the perovskite active layer is crucial for the performance of perovskite LEDs. Currently, enhancing the quality of perovskite films involves additive engineering, which includes introducing organic molecules into the perovskite precursor solution. These molecules, while not integrating into the perovskite lattice, improve film quality by tuning the crystallization pathway. Yet, not every organic molecule is a suitable additive, and unsuitable choices can impair LED performance. Traditional additive selection based on trial-and-error is inefficient and challenged by the small size of perovskite LED additive databases for conventional ML. To address this, the enhanced molecular information model (EMIM) was developed, incorporating both qualitative molecular fingerprints and quantitative descriptors to enhance molecular information and improve predictive accuracy (Fig. 8).39 It is noteworthy that in this work, the additive molecular database used was validated through experimental lab device tests. By utilizing EQE as the output feature and additive molecular descriptors as inputs, along with a 10.3% EQE threshold for classification, the EMIM model attained an impressive 98% prediction accuracy, significantly outperforming traditional ML models. In addition, among the 12 chosen additives, only one showed inconsistent results between EMIM predictions and device fabrication outcomes, demonstrating EMIM’s reliability. One of the additives even elevated the EQE of perovskite LEDs to 22.7%, making it one of the most efficient near-infrared LEDs. This work is pioneering in using ML for analyzing perovskite LEDs, especially for small data additives.

Fig. 8

Traditional method versus ML-assisted method. The traditional method is to verify the effectiveness of the additive for PeLEDs, while the ML-assisted method is to fabricate the device with the “good” additive predicted by ML. The additive is added into the perovskite precursor solution to form the perovskite film. Reproduced with permission from Ref. 39 © 2022 Wiley-VCH.

AP_6_5_054001_f008.png

3.2.2.

Half-lifetime prediction

Stability is a key objective in the development of perovskite LEDs. Stability measurements, unlike EQE measurements, are time-consuming, necessitating hundreds of hours for the most stable perovskite LEDs. Therefore, developing strategies to predict the half-life of perovskite LEDs is crucial. Recently, we found that traditional exponential fitting methods, such as monoexponential, biexponential, and stretched exponential functions, are inadequate for predicting half-life due to complex degradation mechanisms involving ion migration, Joule heating, and interfacial effects.40 To address this, an ensemble learning model was developed to predict the T50 lifetime (time to 50% of initial performance) based on shorter tests T80 (time to 80% of initial performance) (Fig. 9). A database of 210 near-infrared and red perovskite LEDs was initially compiled from experimental data. To augment this dataset, a DNN combined with the fast gradient sign method was employed, effectively doubling the dataset to 420 samples. Comparative analysis of five different ML models, such as least absolute shrinkage and selection operator, SVR, gradient boosting regressor, Gaussian process regressor, and elastic net (EN), showed improved accuracy over the original data. An ensemble learning model (5EML) based on these five algorithms was then used for better fitting the degradation curves of perovskite LEDs, achieving a high score of 0.995. This model also applies to quantum dot LEDs, demonstrating its universality.

Fig. 9

Illustration of lifetime prediction by 5EML. Reproduced with permission from Ref. 40 © 2024 Wiley-VCH.

AP_6_5_054001_f009.png

4.

Conclusion and Perspectives

4.1.

Conclusion

It is evident that the application of ML in the perovskite field has undergone a significant transformation. Initially focused primarily on predicting new perovskite materials, the emphasis has now shifted toward perovskite optoelectronic devices, including device design, crystallization regulation, stability analysis, and manufacturing processes. More importantly, in the fields of perovskite photovoltaics and perovskite LEDs, the focus of ML research differs. Specifically, in perovskite photovoltaics, ML is primarily used for interface engineering of the perovskite layer and optimization of the transport layers, whereas in perovskite LEDs, it mainly focuses on additive screening and stability prediction. This difference essentially arises from the distinct optimization strategies for these two types of devices.88 Perovskite LEDs are extremely sensitive to defects, so the key is to suppress defects and enhance photoluminescence quantum efficiency. In contrast, perovskite photovoltaics are less sensitive to defects, with the primary focus being on charge carrier transport.

Certainly, the application of ML in perovskite optoelectronics has the following characteristics. First, the primary source of data for perovskite optoelectronic devices is derived from experimental results, ensuring their authenticity and dependability. Second, this research area strongly emphasizes the veracity of ML inferences. ML outcomes are typically corroborated through actual device fabrication or supplemented with DFT calculations. Third, the SHAP method is widely employed to investigate the attributes of passivation materials, offering critical insights into feature importance. This method serves as a theoretical compass for material design. Fourth, current algorithmic approaches are largely focused on supervised learning, including classification and regression models.

4.2.

Perspectives and Challenges

Despite the notable progress and significant research potential demonstrated by ML in perovskite optoelectronics, the overall volume of research remains limited, and there is a substantial need for further in-depth exploration. The development of perovskite optoelectronic devices is in a relatively nascent stage, and a considerable amount of experimental data is yet to be made publicly available, which poses challenges for the application of ML techniques. Moreover, the optimization process of perovskite optoelectronic devices is exceedingly complex and involves a multitude of details. Slight variances during the device fabrication process can lead to significantly divergent experimental outcomes, making it difficult to assess the reliability of data samples. Therefore, regarding the intersection of ML and perovskite optoelectronic devices, we propose the following outlooks and suggestions.

Increasing the open-sourcing of perovskite data. The open-sourcing of algorithms has become increasingly common, enabling researchers in the perovskite field to effectively utilize these models. However, the open-sourcing of perovskite data remains relatively limited. Therefore, it is recommended to establish a platform for perovskite data sharing. In addition, an abundance of data samples can help address concerns about the reliability of experimental data. It is important to note that not all collected experimental data are used for ML. Instead, data samples that have undergone preprocessing and feature engineering are utilized. In other words, the robust data processing capabilities of ML can mitigate minor biases introduced during experimental data collection.

To tackle the challenge of limited datasets, researchers are encouraged to delve into data augmentation methodologies that utilize the physical and chemical properties of perovskite molecules as integral feature descriptors in ML models. This strategy facilitates the extraction of enhanced molecular insights while preserving the intricate structural details. Recent breakthroughs in graph neural networks, specifically in molecular generation, have demonstrated significant potential in broadening the scope of datasets and elevating their quality.89

Strengthen the depth of interdisciplinary research between ML and perovskites. First, the interpretability of ML models is especially critical, as it reveals the decision criteria used by the models, thereby guiding the understanding and optimization strategies for perovskite optoelectronic devices. Second, enhance the application of machine image processing in analyzing perovskite film morphology. For LEDs, a discrete morphology is desirable as it helps improve light extraction efficiency. In contrast, the photovoltaic often prefers large and dense grains to enhance carrier transport. Currently, the study of perovskite film morphology primarily relies on empirical deductions, which fail to accurately analyze and extract underlying morphological patterns.

Code and Data Availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

This work was financially supported by the National Key Research and Development Program of China (Grant No. 2022YFA1204800) and the National Natural Science Foundation of China (Grant Nos. 62288102, 52373220, 62375124, 62134007, and 52233011).

References

1. 

G. Xing et al., “Long-range balanced electron-and hole-transport lengths in organic-inorganic CH3NH3PbI3,” Science, 342 (6156), 344 –347 https://doi.org/10.1126/science.1243167 SCIEAS 0036-8075 (2013). Google Scholar

2. 

H. Tsai et al., “High-efficiency two-dimensional Ruddlesden–Popper perovskite solar cells,” Nature, 536 (7616), 312 –316 https://doi.org/10.1038/nature18306 (2016). Google Scholar

3. 

N. T. P. Hartono et al., “The effect of structural dimensionality on carrier mobility in lead-halide perovskites,” J. Mater. Chem. A, 7 (41), 23949 –23957 https://doi.org/10.1039/C9TA05241K (2019). Google Scholar

4. 

S. D. Stranks et al., “Electron-hole diffusion lengths exceeding 1 micrometer in an organometal trihalide perovskite absorber,” Science, 342 (6156), 341 –344 https://doi.org/10.1126/science.1243982 SCIEAS 0036-8075 (2013). Google Scholar

5. 

M. Liu, M. B. Johnston and H. J. Snaith, “Efficient planar heterojunction perovskite solar cells by vapour deposition,” Nature, 501 (7467), 395 –398 https://doi.org/10.1038/nature12509 (2013). Google Scholar

6. 

M. M. Lee et al., “Efficient hybrid solar cells based on meso-superstructured organometal halide perovskites,” Science, 338 (6107), 643 –647 https://doi.org/10.1126/science.1228604 SCIEAS 0036-8075 (2012). Google Scholar

7. 

H.-S. Kim et al., “Lead iodide perovskite sensitized all-solid-state submicron thin film mesoscopic solar cell with efficiency exceeding 9%,” Sci. Rep., 2 (1), 591 https://doi.org/10.1038/srep00591 SRCEC3 2045-2322 (2012). Google Scholar

8. 

Z.-K. Tan et al., “Bright light-emitting diodes based on organometal halide perovskite,” Nat. Nanotechnol., 9 (9), 687 –692 https://doi.org/10.1038/nnano.2014.149 NNAABX 1748-3387 (2014). Google Scholar

9. 

L. C. Schmidt et al., “Nontemplate synthesis of CH3NH3PbBr3 perovskite nanoparticles,” J. Am. Chem. Soc., 136 (3), 850 –853 https://doi.org/10.1021/ja4109209 JACSAT 0002-7863 (2014). Google Scholar

10. 

C. Motta et al., “Revealing the role of organic cations in hybrid halide perovskite CH3NH3PbI3,” Nat. Commun., 6 (1), 7026 https://doi.org/10.1038/ncomms8026 NCAOBW 2041-1723 (2015). Google Scholar

11. 

Z. Huang et al., “Anion–π interactions suppress phase impurities in FAPbI3 solar cells,” Nature, 623 (7987), 531 –537 https://doi.org/10.1038/s41586-023-06637-w (2023). Google Scholar

12. 

M. A. Green et al., “Solar cell efficiency tables (version 63),” Prog. Photovoltaics Res. Appl., 32 (1), 3 –13 https://doi.org/10.1002/pip.3750 (2024). Google Scholar

13. 

H. Chen et al., “Improved charge extraction in inverted perovskite solar cells with dual-site-binding ligands,” Science, 384 (6692), 189 –193 https://doi.org/10.1126/science.adm9474 SCIEAS 0036-8075 (2024). Google Scholar

14. 

J. Zhou et al., “Highly efficient and stable perovskite solar cells via a multifunctional hole transporting material,” Joule, 8 (6), 1691 –1706 https://doi.org/10.1016/j.joule.2024.02.019 (2024). Google Scholar

15. 

M. Li et al., “Acceleration of radiative recombination for efficient perovskite LEDs,” Nature, 630 (8017), 631 –635 https://doi.org/10.1038/s41586-024-07460-7 (2024). Google Scholar

16. 

Y. Cao et al., “Perovskite light-emitting diodes based on spontaneously formed submicrometre-scale structures,” Nature, 562 (7726), 249 –253 https://doi.org/10.1038/s41586-018-0576-2 (2018). Google Scholar

17. 

Y. He et al., “CsPbBr3 perovskite detectors with 1.4% energy resolution for high-energy γ-rays,” Nat. Photonics, 15 (1), 36 –42 https://doi.org/10.1038/s41566-020-00727-1 NPAHBY 1749-4885 (2021). Google Scholar

18. 

J. Jiang et al., “Red perovskite light-emitting diodes with efficiency exceeding 25% realized by co-spacer cations,” Adv. Mater., 34 (36), 2204460 https://doi.org/10.1002/adma.202204460 ADVMEW 0935-9648 (2022). Google Scholar

19. 

J. S. Kim et al., “Ultra-bright, efficient and stable perovskite light-emitting diodes,” Nature, 611 (7937), 688 –694 https://doi.org/10.1038/s41586-022-05304-w (2022). Google Scholar

20. 

Y. Jiang et al., “Synthesis-on-substrate of quantum dot solids,” Nature, 612 (7941), 679 –684 https://doi.org/10.1038/s41586-022-05486-3 (2022). Google Scholar

21. 

Y. Sun et al., “Bright and stable perovskite light-emitting diodes in the near-infrared range,” Nature, 615 (7954), 830 –835 https://doi.org/10.1038/s41586-023-05792-4 (2023). Google Scholar

22. 

W. Bai et al., “Perovskite light-emitting diodes with an external quantum efficiency exceeding 30%,” Adv. Mater., 35 (39), 2302283 https://doi.org/10.1002/adma.202302283 ADVMEW 0935-9648 (2023). Google Scholar

23. 

S. Yuan et al., “Efficient blue electroluminescence from reduced-dimensional perovskites,” Nat. Photonics, 18 (5), 425 –431 https://doi.org/10.1038/s41566-024-01382-6 NPAHBY 1749-4885 (2024). Google Scholar

24. 

L. Kong et al., “Fabrication of red-emitting perovskite LEDs by stabilizing their octahedral structure,” Nature, 631 73 –79 https://doi.org/10.1038/s41586-024-07531-9 (2024). Google Scholar

25. 

Y. Feng et al., “Nucleophilic reaction-enabled chloride modification on CsPbI3 quantum dots for pure red light-emitting diodes with efficiency exceeding 26%,” Angew. Chem. Int. Ed., 63 (11), e202318777 https://doi.org/10.1002/anie.202318777 (2024). Google Scholar

26. 

J. Jiang et al., “Efficient pure-red perovskite light-emitting diodes with strong passivation via ultrasmall-sized molecules,” Sci. Adv., 10 (18), eadn5683 https://doi.org/10.1126/sciadv.adn5683 STAMCV 1468-6996 (2024). Google Scholar

27. 

H. Zhu et al., “Lead halide perovskite nanowire lasers with low lasing thresholds and high quality factors,” Nat. Mater., 14 (6), 636 –642 https://doi.org/10.1038/nmat4271 NMAACR 1476-1122 (2015). Google Scholar

28. 

C. Bao et al., “Bidirectional optical signal transmission between two identical devices using perovskite diodes,” Nat. Electron., 3 (3), 156 –164 https://doi.org/10.1038/s41928-020-0382-3 NEREBX 0305-2257 (2020). Google Scholar

29. 

K. Sakhatskyi et al., “Stable perovskite single-crystal X-ray imaging detectors with single-photon sensitivity,” Nat. Photonics, 17 (6), 510 –517 https://doi.org/10.1038/s41566-023-01207-y NPAHBY 1749-4885 (2023). Google Scholar

30. 

K. Wen et al., “Continuous-wave lasing in perovskite LEDs with an integrated distributed feedback resonator,” Adv. Mater., 35 (44), 2303144 https://doi.org/10.1002/adma.202303144 ADVMEW 0935-9648 (2023). Google Scholar

31. 

L. Gu et al., “High Q-factor and low threshold continuous-wave near-infrared lasing with quasi-2D perovskites,” Adv. Funct. Mater., 33 (44), 2303900 https://doi.org/10.1002/adfm.202303900 AFMDC6 1616-301X (2023). Google Scholar

32. 

C. Qin et al., “Stable room-temperature continuous-wave lasing in quasi-2D perovskite films,” Nature, 585 (7823), 53 –57 https://doi.org/10.1038/s41586-020-2621-1 (2020). Google Scholar

33. 

Q. Xu et al., “Rationalizing perovskite data for machine learning and materials design,” J. Phys. Chem. Lett., 9 (24), 6948 –6954 https://doi.org/10.1021/acs.jpclett.8b03232 JPCLCD 1948-7185 (2018). Google Scholar

34. 

N. Wang et al., “Perovskite light-emitting diodes based on solution-processed self-organized multiple quantum wells,” Nat. Photonics, 10 (11), 699 –704 https://doi.org/10.1038/nphoton.2016.185 NPAHBY 1749-4885 (2016). Google Scholar

35. 

L. Zhu et al., “Large organic cations in quasi-2D perovskites for high-performance light-emitting diodes,” J. Phys. Chem. Lett., 11 (20), 8502 –8510 https://doi.org/10.1021/acs.jpclett.0c02476 JPCLCD 1948-7185 (2020). Google Scholar

36. 

V. Gladkikh et al., “Machine learning for predicting the band gaps of ABX3 perovskites from elemental properties,” J. Phys. Chem. C, 124 (16), 8905 –8918 https://doi.org/10.1021/acs.jpcc.9b11768 JPCCCK 1932-7447 (2020). Google Scholar

37. 

K. Higgins et al., “Chemical robotics enabled exploration of stability in multicomponent lead halide perovskites via machine learning,” ACS Energy Lett., 5 (11), 3426 –3436 https://doi.org/10.1021/acsenergylett.0c01749 (2020). Google Scholar

38. 

J. Kirman et al., “Machine-learning-accelerated perovskite crystallization,” Matter, 2 (4), 938 –947 https://doi.org/10.1016/j.matt.2020.02.012 (2020). Google Scholar

39. 

L. Zhang et al., “Deep learning for additive screening in perovskite light‐emitting diodes,” Angew. Chem. Int. Ed., 61 (37), e202209337 https://doi.org/10.1002/anie.202209337 (2022). Google Scholar

40. 

L. Zhang et al., “Prediction of operational lifetime of perovskite light emitting diodes by machine learning,” Adv. Intell. Syst., 6 (6), 2300772 https://doi.org/10.1002/aisy.202300772 (2024). Google Scholar

41. 

N. K. Bansal et al., “Machine learning in perovskite solar cells: recent developments and future perspectives,” Energy Tech., 11 (12), 2300735 https://doi.org/10.1002/ente.202300735 (2023). Google Scholar

42. 

Y. Liu et al., “Machine learning for perovskite solar cells and component materials: key technologies and prospects,” Adv. Funct. Mater., 33 (17), 2214271 https://doi.org/10.1002/adfm.202214271 AFMDC6 1616-301X (2023). Google Scholar

43. 

T. Liu et al., “Machine-learning accelerating the development of perovskite photovoltaics,” Sol. RRL, 7 (23), 2300650 https://doi.org/10.1002/solr.202300650 (2023). Google Scholar

44. 

T. J. Jacobsson et al., “An open-access database and analysis tool for perovskite solar cells based on the FAIR data principles,” Nat. Energy, 7 (1), 107 –115 https://doi.org/10.1038/s41560-021-00941-3 (2022). Google Scholar

45. 

Q. Tao et al., “Machine learning for perovskite materials design and discovery,” NPJ Comput. Mater., 7 (1), 23 https://doi.org/10.1038/s41524-021-00495-8 (2021). Google Scholar

46. 

M. Srivastava et al., “Machine learning roadmap for perovskite photovoltaics,” J. Phys. Chem. Lett., 12 (32), 7866 –7877 https://doi.org/10.1021/acs.jpclett.1c01961 JPCLCD 1948-7185 (2021). Google Scholar

47. 

J. Benesty et al., “Pearson correlation coefficient,” Noise Reduction in Speech Processing, 2 1 –4 Springer Berlin Heidelberg, Berlin, Heidelberg (2009). Google Scholar

48. 

N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Trans. Syst. Man Cybern., 9 (1), 62 –66 https://doi.org/10.1109/TSMC.1979.4310076 (1979). Google Scholar

49. 

L. Breiman, “Random forests,” Mach. Learn., 45 (1), 5 –32 https://doi.org/10.1023/A:1010933404324 MALEEZ 0885-6125 (2001). Google Scholar

50. 

C. Cortes and V. Vapnik, “Support-vector networks,” Mach. Learn., 20 (3), 273 –297 https://doi.org/10.1007/BF00994018 MALEEZ 0885-6125 (1995). Google Scholar

51. 

S. Albawi, T. A. Mohammed and S. Al-Zawi, “Understanding of a convolutional neural network,” in Int. Conf. Eng. Technol., 1 –6 (2017). https://doi.org/10.1109/ICEngTechnol.2017.8308186 Google Scholar

52. 

S. M. Lundberg and S.-I. Lee, “A unified approach to interpreting model predictions,” (2017). Google Scholar

53. 

R. Caruana and A. Niculescu-Mizil, “An empirical comparison of supervised learning algorithms,” in Int. Conf. Mach. Learn., 161 –168 (2006). https://doi.org/10.1145/1143844.1143865 Google Scholar

54. 

P. C. Sen, M. Hajra, M. Ghosh, “Supervised classification algorithms in machine learning: a survey and review,” Emerging Technology in Modelling and Graphics, 99 –111 Springer, Singapore (2020). Google Scholar

55. 

P. Cunningham, M. Cord, S. J. Delany, “Supervised learning,” Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval, 21 –49 Springer, Berlin, Heidelberg (2008). Google Scholar

56. 

T. Hastie, J. Friedman and R. Tibshirani, The Elements of Statistical Learning, Springer New York, New York (2001). Google Scholar

57. 

S. B. Kotsiantis, I. D. Zaharakis and P. E. Pintelas, “Machine learning: a review of classification and combining techniques,” Artif. Intell. Rev., 26 (3), 159 –190 https://doi.org/10.1007/s10462-007-9052-3 AIREV6 (2006). Google Scholar

58. 

I. G. Maglogiannis, Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in EHealth, HCI, Information Retrieval and Pervasive Technologies, IOS Press, Amsterdam (2007). Google Scholar

59. 

A. E. Maxwell, T. A. Warner and F. Fang, “Implementation of machine-learning classification in remote sensing: an applied review,” Int. J. Remote Sens., 39 (9), 2784 –2817 https://doi.org/10.1080/01431161.2018.1433343 IJSEDK 0143-1161 (2018). Google Scholar

60. 

K. P. Murphy, Machine Learning: A Probabilistic Perspective, MIT Press( (2012). Google Scholar

61. 

V. Stanev et al., “Machine learning modeling of superconducting critical temperature,” NPJ Comput. Mater., 4 (1), 29 https://doi.org/10.1038/s41524-018-0085-8 (2018). Google Scholar

62. 

A. J. Myles et al., “An introduction to decision tree modeling,” J. Chemom., 18 (6), 275 –285 https://doi.org/10.1002/cem.873 JOCHEU 0886-9383 (2004). Google Scholar

63. 

J. H. Friedman, “Stochastic gradient boosting,” Comput. Stat. Data Anal., 38 (4), 367 –378 https://doi.org/10.1016/S0167-9473(01)00065-2 CSDADW 0167-9473 (2002). Google Scholar

64. 

T. Hastie, R. Tibshirani, J. Friedman, “Unsupervised learning,” The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 485 –585 Springer, New York (2009). Google Scholar

65. 

G. Hinton and T. J. Sejnowski, Unsupervised Learning: Foundations of Neural Computation, MIT Press, Cambridge, Masschusetts (1999). Google Scholar

66. 

Z. Ghahramani, Advanced Lectures on Machine Learning: ML Summer Schools 2003, Canberra, Australia, February 2-14, 2003, Tübingen, Germany, August 4-16, 2003, Revised Lectures, Springer, Berlin, Heidelberg (2004). Google Scholar

67. 

D. Greene, P. Cunningham, R. Mayer, “Unsupervised learning and clustering,” Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval, 51 –90 Springer, Berlin, Heidelberg (2008). Google Scholar

68. 

K. R. Chowdhary, “Natural language processing,” Fundamentals of Artificial Intelligence, 603 –649 Springer India, New Delhi (2020). Google Scholar

69. 

A. Gardin et al., “Classifying soft self-assembled materials via unsupervised machine learning of defects,” Commun. Chem., 5 (1), 82 https://doi.org/10.1038/s42004-022-00699-z CFBEEC (2022). Google Scholar

70. 

D. A. Reynolds, “Gaussian mixture models,” Encyclopedia Biometr., 741 659 –663 https://doi.org/10.1007/978-0-387-73003-5_196 (2009). Google Scholar

71. 

T. Kohonen, “The self-organizing map,” Proc. IEEE, 78 (9), 1464 –1480 https://doi.org/10.1109/5.58325 IEEPAD 0018-9219 (1990). Google Scholar

72. 

X. Zhu and A. B. Goldberg, Introduction to Semi-Supervised Learning, Springer Nature( (2022). Google Scholar

73. 

X. Zhai et al., “S4L: self-supervised semi-supervised learning,” in Proc. IEEE/CVF Int. Conf. Comput. Vis., 1476 –1485 (2019). https://doi.org/10.1109/ICCV.2019.00156 Google Scholar

74. 

“Semi-supervised learning (Chapelle, O. et al., Ed.; 2006) [Book reviews],” IEEE Trans. Neural Netw., 20 (3), 542 https://doi.org/10.1109/TNN.2009.2015974 ITNNEP 1045-9227 (2009). Google Scholar

75. 

X. Zhu, Semi-Supervised Learning Literature Survey, University of Wisconsin-Madison, Department of Computer Sciences, CS Technical Report TR1530 ( (2005). Google Scholar

76. 

M.-R. Amini et al., “Self-training: a survey,” (2023). Google Scholar

77. 

M.-F. Balcan, A. Blum and K. Yang, “Co-training and expansion: towards bridging theory and practice,” in Adv. Neural Inf. Process. Syst., 89 –96 (2004). Google Scholar

78. 

N. T. P. Hartono et al., “How machine learning can help select capping layers to suppress perovskite degradation,” Nat. Commun., 11 (1), 4172 https://doi.org/10.1038/s41467-020-17945-4 NCAOBW 2041-1723 (2020). Google Scholar

79. 

C. Zhi et al., “Machine-learning-assisted screening of interface passivation materials for perovskite solar cells,” ACS Energy Lett., 8 (3), 1424 –1433 https://doi.org/10.1021/acsenergylett.2c02818 (2023). Google Scholar

80. 

C. She et al., “Machine learning-guided search for high-efficiency perovskite solar cells with doped electron transport layers,” J. Mater. Chem. A, 9 (44), 25168 –25177 https://doi.org/10.1039/D1TA08194B (2021). Google Scholar

81. 

W. Liu et al., “Machine learning enables intelligent screening of interface materials towards minimizing voltage losses for p-i-n type perovskite solar cells,” J. Energy Chem., 83 128 –137 https://doi.org/10.1016/j.jechem.2023.04.015 (2023). Google Scholar

82. 

S. Kim et al., “PubChem substance and compound databases,” Nucl. Acids Res., 44 (D1), D1202 –D1213 https://doi.org/10.1093/nar/gkv951 (2016). Google Scholar

83. 

Y. Hu et al., “Machine‐learning modeling for ultra‐stable high‐efficiency perovskite solar cells,” Adv. Energy Mater., 12 (41), 2201463 https://doi.org/10.1002/aenm.202201463 ADEMBC 1614-6840 (2022). Google Scholar

84. 

T. Bak et al., “Accelerated design of high-efficiency lead-free tin perovskite solar cells via machine learning,” Int. J. Precis. Eng. Manuf. Green Technol., 10 (1), 109 –121 https://doi.org/10.1007/s40684-022-00417-z (2023). Google Scholar

85. 

J. Wang et al., “Advancing vapor-deposited perovskite solar cells via machine learning,” J. Mater. Chem. A, 11 (25), 13201 –13208 https://doi.org/10.1039/D3TA00027C (2023). Google Scholar

86. 

N. Taherimakhsousi et al., “A machine vision tool for facilitating the optimization of large-area perovskite photovoltaics,” NPJ Comput. Mater., 7 (1), 190 https://doi.org/10.1038/s41524-021-00657-8 (2021). Google Scholar

87. 

K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” (2015). Google Scholar

88. 

S. Wang et al., “Carrier dynamics determines the optimization strategies of perovskite LEDs and PVs,” Research, 6 0112 https://doi.org/10.34133/research.0112 (2023). Google Scholar

89. 

F. Scarselli et al., “The graph neural network model,” IEEE Trans. Neural Netw., 20 (1), 61 –80 https://doi.org/10.1109/TNN.2008.2005605 ITNNEP 1045-9227 (2009). Google Scholar

Biography

Feiyue Lu received his master’s degree from the Institute of Advanced Materials and School of Flexible Electronics (Future Technologies), Nanjing Tech University (NanjingTech) in 2024. He currently focuses on the interdisciplinary research of machine learning and perovskite optoelectronic devices.

Lin Zhu received her PhD in chemistry from Nanjing University. She subsequently joined Nanjing Tech University, where her current research focuses on perovskite LEDs and machine learning.

Jianpu Wang received his PhD in organic semiconductor/inorganic nanocrystal devices from the Cavendish Laboratory at the University of Cambridge. He subsequently worked as a postdoctoral research associate at the Cavendish Laboratory from 2009 to 2013. His current research interests include organic and perovskite semiconductor devices, device physics, and artificial intelligence.

Biographies of the other authors are not available.

CC BY: © The Authors. Published by SPIE and CLP under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
Feiyue Lu, Yanyan Liang, Nana Wang, Lin Zhu, and Jianpu Wang "Machine learning for perovskite optoelectronics: a review," Advanced Photonics 6(5), 054001 (27 August 2024). https://doi.org/10.1117/1.AP.6.5.054001
Received: 23 May 2024; Accepted: 1 August 2024; Published: 27 August 2024
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
KEYWORDS
Perovskite

Data modeling

Machine learning

Optoelectronics

Optoelectronic devices

Light emitting diodes

Photovoltaics

Back to Top