1 min readOct 10, 2018
Great article! Much of the code uses Spark (which is naturally distributed using YARN or some other resource manager), but XGBoost’s Python binding doesn’t appear to be distributed. Does this mean that the actual training happens on only one node?