Have a question?
Message sent Close
4.55 out of 5
4.55
26 reviews on Udemy

Prediction Mapping Using GIS Data and Advanced ML Algorithms

eXtreme Gradient Boosting, K Nearest Neighbour, Naïve Bayes, Random Forest for Prediction Geo-Hazards and Air pollution
Instructor:
Dr. Omar AlThuwaynee
186 students enrolled
English [Auto]
Two applications of susceptibility prediction mapping in GIS, 1) Landslides prediction maps 2) Ambient air pollution prediction maps
Step by step analysis of machine learning algorithms for classification: eXtreme Gradient Boosting (XGBoost) K nearest neighbour (KNN) Naïve Bayes (NB) Random forest (RF)
Run classification based algorithms with training data model accuracy, Kappa index, variables importance, sensitivity analysis of explanatory and response data
Hyper-parameter optimization procedure and application
Model accuracy test and validation using; confusion matrix and results validation using AUC value under ROC plot
Produce prediction maps using Raster and vector dataset

In this course, four machine learning supervised classification based techniques used with remote sensing and geospatial resources data to predict two different types of applications:

     Project 1: Data of Multi-labeled target prediction via multi-label classification (multi class problem). Target (Y) that has 3 labeled classes (instead of Numbers): Names, description, ordinal value (small, large, X-large)..Multiple output maps. Like:

  • Increase specific type of species in certain areas and its relationship with surrounding conditions.

  • Air pollution limits prediction (Good, moderate, unhealthy, Hazardous..)

  • Complex diseases types: potential risk factors and their effects on the disease are investigated to identify risk factors that can be used to develop prevention or intervention strategies.

  • Course application:  Prediction of concentration of particulate matter of less than 10 µm diameter (PM10)

    Project 2: Data of Binary labeled target prediction. Target with 2 classes: Yes and No, Slides and No slide, Happened –Not happened, Contaminated- Clean.

  • Flooded areas and it contribution factors like topographic and climate data.

  • Climate change related consequences and its dragging factors like urban heat islands and it relationship with land uses.

  • Oil spills: polluted and non polluted.

  • Course application:Landslide susceptibility mapping in prone area.

  • If you are previously enrolled in my previous course using ANN, then you have the chance to compare the outcomes, as we used the same landslide data here.

Eventually, all the measured data (training and testing), were used to produce the prediction map to be used in further GIS analysis or directly to be presented to decision makers or writing research article in SCI journals.

This course considered the most advanced, in terms of analysis models and output maps that successfully invested in the (1) machine learning algorithm and geospatial domains; (2) free available data of remote sensing in data scarce environment.

Introduction and Course Content : Get to know what will we talk about!

1
Course contents
2
Course applications: Landslide and Air pollution prediction analysis
3
Projects data, study areas and applications extent
4
Expected outcomes: What will we achieve together!

Practical summary about the classification based machine learning algorithms!

1
CARET package in R
2
Hyperparameters optimization (model tuning) in machine learning
3
eXtreeme Gradient Boosting (XGBoost) classifier machine learning
4
K- nearest neighbors (KNN) classifier machine learning
5
Naive Bayes (NB) classifier machine learning
6
Ensemble Random forest (RF) classifier machine learning
7
Selection of training and testing data concept
8
Current computer and software's specifications that used in the course

Project 1: PM10 prediction mapping : Data record pre-processing and data entry

1
PM10 readings pre-processing and input data preparation in Excel
2
Allocate the air monitoring stations and record data entry in QGIS
3
PM10 readings conversion to WHO limits in QGIS

Project 1 PM10 prediction mapping : Input data-frame processing and production

1
Preparation of PM10 prediction remote sensing variables data-list
2
Landsat 8 imagery download
3
Visualization of downloaded Landsat 8 images
4
Processing of Landsat 8 bands and indices in R
5
Processing of Land Surface Temperature (LST) in R
6
Processing of average monthly and annual Landsat 8 bands and indices in R
7
Processing and production of road networks variable in QGIS
8
Preparation of input dataframe (target and conditioning factors) in QGIS
9
Finalize input variables and convert it to CSV format file in QGIS for modeling

Project 1 PM10 prediction mapping : modeling of advanced ML classifiers in R

1
XGBoost algorithm: Data entry and visualization in R
2
XGBoost algorithm: Run of train default function
3
XGBoost algorithm: Hyper-parameter optimization and plot (model tuning)
4
XGBoost algorithm: AUC value of ROC plot
5
XGBoost algorithm: Fit optimized model using all inventory observations
6
XGBoost algorithm: Conversion to dataframe and scaling of Raster images
7
XGBoost algorithm: Probability prediction maps production
8
XGBoost algorithm: Classification prediction maps production
9
NB algorithm: ggplot of linearity between target and independents and variables
10
NB algorithm: Run of train default function
11
NB algorithm: Hyper-parameter optimization, AUC of ROC plot & normalized Rasters
12
NB algorithm: Probability and classification prediction maps production
13
KNN algorithm: Run of train function and hyper-parameter optimized models
14
KNN algorithm: AUC of ROC and probability and classification prediction maps
15
RF algorithm: Data entry and train function using Grid search tuning
16
RF algorithm: train function using Random search tuning and AUC of ROC
17
RF algorithm: Scaling and conversion of raster images to dataframe
18
RF algorithm: Probability prediction map
19
RF algorithm: Classification prediction map
20
Summary and Visualization of 4 algorithms prediction resultant maps in QGIS

Project 2 Landslide : Create training and testing data in QGIS

1
Adding my developed tools to QGIS processing library
2
Create Land Cover map (convert string observations to numeric) in QGIS
3
Run the tools Step 1
4
Run the tools Step 2
5
Run the tools Step 3

Project 2 Landslide prediction mapping : pre-processing training data in Excel

1
Excel work step 1
2
Excel work step 2

Project 2 Landslide prediction mapping : modeling of advanced ML classifiers

1
XGBoost algorithm : Training and testing data entry in R
2
XGBoost algorithm : Run train function using default settings
3
XGBoost algorithm: Hyper-parameter optimization (model tuning) and pairs plot
4
XGBoost algorithm: AUC of ROC plot and important technical error
5
XGBoost algorithm: Run optimized model and probability prediction maps
6
XGBoost algorithm: Classification prediction map production
7
KNN algorithm : Data entry and visualization of target and other variables
8
KNN algorithm: Run of train function and hyper-parameter optimized models
9
KNN algorithm: AUC of ROC plot and technical issues with data entry
10
KNN algorithm: probability prediction maps
11
KNN algorithm: classification prediction map
12
NB algorithm: Training data entry and visualization of variables
13
NB algorithm: Train function and Hyper-parameters and AUC of ROC plot
14
NB algorithm: Probability and classification prediction maps production
15
RF algorithm: Data entry of training data variables
16
RF algorithm: default train function and Hyper-parameter and AUC of ROC plot
17
RF algorithm: Probability and classification prediction maps
18
Summary and Visualization of 4 algorithms prediction maps in QGIS

Projects Conclusion and main remarks of the presented course

1
Summary: Let us sum up everything and recap what we discussed earlier!
You can view and review the lecture materials indefinitely, like an on-demand channel.
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don't have an internet connection, some instructors also let their students download course lectures. That's up to the instructor though, so make sure you get on their good side!
4.6
4.6 out of 5
26 Ratings

Detailed Rating

Stars 5
12
Stars 4
11
Stars 3
1
Stars 2
1
Stars 1
1
edfe9a5f2363badc9220a2f2fdf43935

Includes

16 hours on-demand video
Certificate of Completion

About

AulaGEO is a Ge-engineering specialized academy.

Social Network

Udemy Black Friday Sale! Top Courses From $9.99