Statistics | Regression | Machine learning

Data projects with the evaluation built in.

I am Lei Duli, a statistics student at the University of Arizona. I build reproducible analysis projects that connect modeling choices, diagnostics, and real-world interpretation.

48,895

Airbnb listings analyzed

0.507

best adjusted R2

0.922

best ROC AUC

41.9%

high-intent conversion rate

Featured work

Selected Projects

Course projects turned into portfolio pieces: clear questions, reproducible code, model comparison, diagnostics, and reports.

Data preprocessing

Large-Scale Data Preprocessing and Feature Discretization

Prepared a used-car transaction dataset through missing value handling, outlier detection, discretization, and encoding.

Python pandas scikit-learn Feature engineering

Research synthesis

Heuristic Optimization of LSTM-Based Models

Reviewed PSO, GA, SA, and ACO optimization strategies for LSTM-based forecasting across applied time-series domains.

LSTM PSO GA Time series

Technical toolkit

What I work with

Languages

R, Python, SQL

Data work

EDA, cleaning, feature engineering, visualization, reporting

Modeling

OLS, GLM, logistic regression, LASSO, PCA, random forest, gradient boosting, SVM

Evaluation

Cross-validation, diagnostics, ROC AUC, F1, precision, recall, balanced accuracy