Yang Liu
  • Home
  • Posts
  • About
  • LinkedIn
Categories
All (18)
Blog (2)
Catalog (1)
Data Engineering (1)
Data Visualization (9)
GitHub (1)
Machine Learning (4)
R package (2)
Reproducible Workflow (1)
Shiny (1)
Spatial Data (2)

Posts

All migrated research notes and technical posts.

2025 SuperNationals VIII: A Once-in-Four-Years Chess Spectacle

Blog

Participation, section design, rating distributions, and standing patterns from the 2025 SuperNationals VIII chess tournament in Orlando.

May 15, 2025
Yang Liu

2024 National K-12: Does a Higher Rating Lead to a Higher Standing?

Blog

A look at participation, rating distributions, and the relationship between rating and final standing at the 2024 National K-12 Grades Championship.

Dec 15, 2024
Yang Liu

How many children under 5 worldwide?

Data Visualization

A compact reference for global age-group population totals from the UN World Population Prospects 2024 mid-year estimates.

Dec 13, 2024
Yang Liu

Plot multiple countries on the world map

Data Visualization
Spatial Data

How to plot administrative areas from several countries in R, and how to avoid stray connecting lines caused by non-unique polygon group IDs.

May 29, 2023
Yang Liu

R Markdown to GitHub Pages

GitHub

A concise legacy R Markdown workflow for publishing small GitHub Pages sites, now preserved with a Quarto-era maintenance note.

Jan 6, 2021
Yang Liu

US and China Admin1 COVID-19 mortality and incidence rate

Data Visualization

R examples for mapping COVID-19 mortality and incidence rates across US states and Chinese provinces, with a comparison to lung cancer maps.

Jun 14, 2020
Yang Liu

ggplot US state and China province heatmap

Data Visualization

R examples for drawing US state and China province heatmaps with ggplot2, usmap, shapefiles, labels, scale bars, and leaflet-based alternatives.

Jun 12, 2020
Yang Liu

Working with 3D array as long-format data in R

Data Engineering

A compact R example for melting a 3D array into long-format data, modifying it, and rebuilding the array in the correct dimension order.

Jan 11, 2020
Yang Liu

Drake: powerful tool for automatic reproducible workflow

Reproducible Workflow

A legacy R workflow note showing how drake can cache calculations and speed up R Markdown report rendering, with a note on modern targets workflows.

Sep 15, 2019
Yang Liu

Notes on writing an R package

R package

Practical notes from preparing the SHAPforxgboost R package for CRAN, including DESCRIPTION, NAMESPACE, documentation, checks, and reviewer feedback.

Jul 28, 2019
Yang Liu

SHAP for XGBoost in R: SHAPforxgboost

R package
Machine Learning
Data Visualization

A vignette-style guide to the SHAPforxgboost R package, covering SHAP values, global importance, dependence plots, interaction effects, and force plots.

Jul 18, 2019
Yang Liu
 

Shiny in Blogdown

Data Visualization
Shiny

A short R note on embedding a hosted Shiny app inside a static blog page using an iframe or knitr helper.

Feb 24, 2019
Yang Liu
 

Study shrinkage and DART in xgboost modeling using a simple dataset

Machine Learning

A small R experiment that illustrates how shrinkage and DART dropout change the behavior of XGBoost trees on a simple toy dataset.

Nov 15, 2018
Yang Liu

SHAP Visualization in R (first post)

Machine Learning
Data Visualization

The original R post on SHAP visualizations for XGBoost, now linked to the later SHAPforxgboost package vignette.

Oct 14, 2018
Yang Liu

Spatial data in R: Dividing raster layers into equal-area rings

Data Visualization
Spatial Data

An R spatial workflow for turning raster layers into equal-area rings, calculating urban saturation, and plotting city comparisons on a shared scale.

Jul 20, 2018
Yang Liu

How to Draw Heatmap with Colorful Dendrogram

Data Visualization

An R visualization example that builds a clustered heatmap with colorful dendrogram branches and labels using gplots, dendextend, and colorspace.

Jul 16, 2018
Yang Liu

eXtreme Gradient Boosting (XGBoost): Better than random forest or gradient boosting

Machine Learning

A hands-on R comparison of XGBoost, gradient boosting, random forest, lasso, and best subset regression on a slum-settlement modeling example.

Jul 9, 2018
Yang Liu

Catalog of my old blog

Catalog

A historical catalog of posts from my earlier Blogger site, with links to older work on statistics, data visualization, R programming, and urban research.

Jul 1, 2018
Yang Liu
No matching items
  1. 2026 Yang Liu
 

Built with Quarto and R