Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
SQL Server 2017 Machine Learning Services with R

You're reading from   SQL Server 2017 Machine Learning Services with R Data exploration, modeling, and advanced analytics

Arrow left icon
Product type Paperback
Published in Feb 2018
Publisher Packt
ISBN-13 9781787283572
Length 338 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (2):
Arrow left icon
Toma≈æ Ka≈°trun Kaštrun Toma≈æ Ka≈°trun Kaštrun
Author Profile Icon Toma≈æ Ka≈°trun Kaštrun
Toma≈æ Ka≈°trun Kaštrun
 Koesmarno Koesmarno
Author Profile Icon Koesmarno
Koesmarno
Arrow right icon
View More author details
Toc

Table of Contents (17) Chapters Close

Title Page
Copyright and Credits
www.PacktPub.com
Contributors
Preface
1. Introduction to R and SQL Server FREE CHAPTER 2. Overview of Microsoft Machine Learning Server and SQL Server 3. Managing Machine Learning Services for SQL Server 2017 and R 4. Data Exploration and Data Visualization 5. RevoScaleR Package 6. Predictive Modeling 7. Operationalizing R Code 8. Deploying, Managing, and Monitoring Database Solutions containing R Code 9. Machine Learning Services with R for DBAs 10. R and SQL Server 2016/2017 Features Extended 1. Other Books You May Enjoy Index

Index

A

  • Adventureworks database
    • URL / Importing data using ODBC
  • Air Flights data
    • URL / Testing rxLinMod performance on a table with a primary key
  • Analysis Of Variance (ANOVA) / Data modeling
  • analytical barriers / Analytical barriers
  • Area Under Curve (AUC) / Prerequisite – prepare the data
  • arguments, packages
    • @language / Arguments
    • @script / Arguments
    • @input_data_1 / Arguments
    • @input_data_1_name / Arguments
    • @output_data_1_name / Arguments
    • @parallel / Arguments
    • @params / Arguments
    • @parameter1 / Arguments
  • Azure Blob Storage
    • connectivity, enabling / Accessing external data sources using PolyBase

B

  • baseline
    • creating / Creating a baseline and workloads, and replaying
  • boxplot / Boxplot
  • built-in JSON capabilities
    • about / Built-in JSON capabilities
    • using / Built-in JSON capabilities
    • forecast package, using / Built-in JSON capabilities

C

  • ColumnStore
    • and in memory OLTP, used for performance testing / High performance using ColumnStore and in memory OLTP
  • continuous delivery
    • setting up, with VSTS / Setting up continuous delivery
  • continuous integration (CI)
    • setting up, in VSTS / Setting up continuous integration
  • CRAN
    • versus MRAN / Analytical barriers
  • custom reports
    • URL / Using custom reports for SQL Server R Services
    • using / Using custom reports for SQL Server R Services
    • R Services - Configuration.rdl / Using custom reports for SQL Server R Services
    • R Services - Packages.rdl / Using custom reports for SQL Server R Services
    • R Services - Resource Usage.rdl / Using custom reports for SQL Server R Services
    • R Services - Extended Events.rdl / Using custom reports for SQL Server R Services
    • R Services - Execution Statistics.rdl / Using custom reports for SQL Server R Services
    • adding / Adding the custom reports for the first time
    • viewing / Viewing an R Services custom report

D

  • data
    • gathering, for DBAs / Gathering relevant data
    • exploring / Exploring and analyzing data
    • analyzing / Exploring and analyzing data
  • data exploration
    • about / Data exploration and data munging
    • head(df) function / Exploring data in R
    • in R / Exploring data in R
    • tail(df) function / Exploring data in R
    • summary(df) function / Exploring data in R
    • names(df) function / Exploring data in R
    • str(df) function / Exploring data in R
    • describe(df$col) function / Exploring data in R
    • example / Example - data exploration and munging using R in T-SQL
  • data frames, R / Data frames in R
    • rows, adding / Adding/removing rows/columns in data frames
    • rows, removing / Adding/removing rows/columns in data frames
    • columns, adding / Adding/removing rows/columns in data frames
    • columns, removing / Adding/removing rows/columns in data frames
  • Data Mining eXpression (DMX) / Using R prior to SQL Server 2016
  • data modeling
    • about / Data modeling
    • URL, for Microsoft dataset / Data modeling
  • data munging
    • about / Data exploration and data munging
    • SQL Server data, importing into R / Importing SQL Server data into R
    • in R / Data munging in R
    • rows/columns, adding/removing in data frames / Adding/removing rows/columns in data frames
    • with dplyr / More data munging with dplyr
    • missing values, searching / Finding missing values
    • data, transposing / Transpose data
    • data, pivoting/unpivoting / Pivot / Unpivot data
    • example / Example - data exploration and munging using R in T-SQL
  • data preparation
    • data, importing from SAS / Data import from SAS, SPSS, and ODBC
    • data, importing from SPSS / Data import from SAS, SPSS, and ODBC
    • data, importing from ODBC / Data import from SAS, SPSS, and ODBC
  • data types, R
    • about / Understanding SQL and R data types
    • reference / Understanding SQL and R data types
  • data types, SQL / Understanding SQL and R data types
  • data visualization
    • in R / Data visualization in R
    • plot / Plot
    • histogram / Histogram
    • with boxplot / Boxplot
    • with scatter plot / Scatter plot
    • with tree diagram / Tree diagram
    • in T-SQL, with example / Example – R data visualization in T-SQL
  • DBAs
    • data, gathering / Gathering relevant data
  • disk usage
    • predicting, with R / Creating predictions with R - disk usage
  • DMVs
    • SQL Server Machine Learning Services, managing / Managing SQL Server Machine Learning Services with DMVs
    • system configuration / System configuration and system resources
    • system resources / System configuration and system resources
    • resource governor / Resource governor
  • DMVs, for resource governor
    • sys.resource_governor_resource_pools / Resource governor
    • sys.resource_governor_workload_groups / Resource governor
    • sys.resource_governor_external_resource_pools / Resource governor
    • sys.dm_resource_governor_external_resource_pool_affinity / Resource governor
  • DMVs, for system performance
    • sys.dm_exec_sessions / System configuration and system resources
    • sys.dm_os_performance_counters / System configuration and system resources
    • sys.dm_external_script_requests / System configuration and system resources
    • sys.dm_external_script_execution_stats / System configuration and system resources
  • dynamic-link library (DLLs) / Installing new R packages

E

  • EMS incidents dataset
    • URL / Built-in JSON capabilities
  • extensibility framework workloads / Extensibility framework workloads
  • eXternal Data Frame (XDF) format / Memory limitations

F

  • fast batch prediction
    • about / Fast batch prediction
    • prerequisites / Prerequisites
    • real-time scoring / Real-time scoring
    • native scoring / Native scoring
    • R prediction model, integrating / Step 1 – Train and save a real-time scoring model using T-SQL
    • real-time scoring model, training with T-SQL / Step 1 – Train and save a real-time scoring model using T-SQL
    • real-time scoring model, saving with T-SQL / Step 1 – Train and save a real-time scoring model using T-SQL
    • R prediction model, operationalizing with real-time scoring / Step 2a – Operationalize the model using real-time scoring
    • model, operationalizing with native scoring / Step 2b – Operationalize the model using native scoring
    • workloads / Fast batch prediction workloads
  • files
    • copying, for packages / Copying files
  • Filestream integration
    • reference / Integrating R code in reports and visualizations
  • forecast package
    • URL / Built-in JSON capabilities

G

  • GitHub Extension for Visual Studio
    • URL / Preparing your environment for the database lifecycle workflow

H

  • histogram / Histogram

I

  • in memory OLTP
    • and ColumnStore, used for performance testing / High performance using ColumnStore and in memory OLTP

L

  • leave one out (LOO) / Deploying and using predictive solutions

M

  • MeanDecreaseGini / Data modeling
  • Microsoft Machine learning R Server platform
    • about / The Microsoft Machine learning R Server platform
    • reference / The Microsoft Machine learning R Server platform
    • Microsoft R Open (MRO) / Microsoft R Open (MRO)
    • Microsoft Machine Learning R Server / Microsoft Machine Learning R Server
    • Microsoft SQL Server Machine Learning R Services / Microsoft SQL Server Machine Learning R Services
    • R Tools for Visual Studio (RTVS) / R Tools for Visual Studio (RTVS)
  • Microsoft Machine Learning R Services
    • architecture / The Microsoft Machine Learning R Services architecture
    • R, limitations / R Limitations
  • MRAN
    • about / Microsoft's commitment to the open source R language
    • versus CRAN / Analytical barriers

O

  • ODBC
    • data, importing / Importing data using ODBC

P

  • packages
    • information, obtaining / Package information
    • installing, with R Tools for Visual Studio (RTVS) / Using R Tools for Visual Studio (RTVS) 2015 or higher
    • R.exe, using in CMD / Using R.exe in CMD
    • XP_CMDSHELL, using / Using XP_CMDSHELL
    • files, copying / Copying files
    • rxInstallPackages function, using / Using the rxInstallPackages function
    • SQL Server R Services, managing with PowerShell / Managing SQL Server R Services with PowerShell
    • sp_execute_external_script external procedure / Getting to know the sp_execute_external_script external procedure
    • arguments / Arguments
  • performance testing
    • with ColumnStore and in memory OLTP / High performance using ColumnStore and in memory OLTP
    • rxLinMod performance, testing / Testing rxLinMod performance on a table with a primary key
    • reference / Comparing results
  • plot / Plot
  • PolyBase
    • used, for accessing external data sources / Accessing external data sources using PolyBase
    • prerequisites / Accessing external data sources using PolyBase
    • URL / Accessing external data sources using PolyBase
  • Power BI
    • R integration / Integrating R in Power BI
    • launching / Integrating R in Power BI
  • PowerShell
    • SQL Server R Services, managing / Managing SQL Server R Services with PowerShell
    • SQL Server prediction operations, executing / Executing SQL Server prediction operations via PowerShell
  • prediction operations
    • and R workloads, integrating / Integrating R workloads and prediction operations beyond SQL Server
    • SQL Server prediction operations, executing via PowerShell / Executing SQL Server prediction operations via PowerShell
    • scheduling / Scheduling training and prediction operations
    • training, scheduling / Scheduling training and prediction operations
    • R script, operationalizing as part of SSIS / Operationalizing R script as part of SSIS
  • predictions
    • performing, with R Services in SQL Server database / Performing predictions with R Services in the SQL Server database
    • creating, with R / Creating predictions with R - disk usage
  • predictive modeling
    • advanced predictive algorithms / Advanced predictive algorithms and analytics
    • advanced analytics / Advanced predictive algorithms and analytics
    • predictive solutions, deploying / Deploying and using predictive solutions
    • predictive solutions, using / Deploying and using predictive solutions
  • productionized model
    • accuracy, monitoring / Monitoring the accuracy of the productionized model
    • references / Useful references
  • Python Tools for Visual Studio (PTVS) / Arguments

R

  • R
    • using, prior to SQL Server 2016 / Using R prior to SQL Server 2016
    • Microsoft's commitment / Microsoft's commitment to the open source R language
    • integrating, with SQL Server for analytics / Boosting analytics with SQL Server R integration
    • data types / Understanding SQL and R data types
    • data frames / Data frames in R
    • SQL Server data, importing / Importing SQL Server data into R
    • data exploration / Exploring data in R
    • data munging / Data munging in R
    • data visualization / Data visualization in R
    • disk usage, predicting / Creating predictions with R - disk usage
  • R, limitations
    • performance issues / Performance issues
    • memory limitations / Memory limitations
    • security aspects / Security aspects
    • language syntax / Language syntax
    • overcomming / Overcomming R language limitations
  • R.exe
    • using, in CMD / Using R.exe in CMD
  • R code
    • operationalizing, with tools / Tools
  • real-time scoring
    • reference / Prerequisites
  • Receiver Operating Characteristic (ROC) curve / Prerequisite – prepare the data
  • reports
    • R integration / Integrating R code in reports and visualizations
  • Resource Governor / Resource Governor
  • RevoScaleR package
    • scalable environment / Scalable and distributive computational environments
    • distributive computational environment / Scalable and distributive computational environments
    • functions, for data preparation / Data import from SAS, SPSS, and ODBC
    • functions, for variable creation / Variable creation and data transformation, Variable creation and recoding
    • functions, for data transformation / Variable creation and data transformation
    • functions, for variable recoding / Variable creation and recoding
    • dataset, subsetting / Dataset subsetting
    • dataset, merging / Dataset merging
    • functions, for descriptive statistics / Functions for descriptive statistics
    • functions, for statistical tests / Functions for statistical tests and sampling
    • functions, for statistical sampling / Functions for statistical tests and sampling
    • URL / Advanced predictive algorithms and analytics
    • rxLinMod function / Advanced predictive algorithms and analytics
    • rxLogit function / Advanced predictive algorithms and analytics
    • rxGlm function / Advanced predictive algorithms and analytics
    • rxDTree function / Advanced predictive algorithms and analytics
    • rxBTrees function / Advanced predictive algorithms and analytics
    • rxDForest function / Advanced predictive algorithms and analytics
    • rxNaiveBayes function / Advanced predictive algorithms and analytics
  • R integration
    • in reports / Integrating R code in reports and visualizations
    • in visualizations / Integrating R code in reports and visualizations
    • in SSRS reports / Integrating R in SSRS reports
    • in Power BI / Integrating R in Power BI
  • RODBC
    • URL / Using R prior to SQL Server 2016
  • R packages
    • installing / Installing new R packages
    • URL / External packages
  • R prediction model
    • integrating / Integrating an existing R model
    • data, preparing / Prerequisite – prepare the data
    • training, with T-SQL / Step 1 – Train and save a model using T-SQL
    • saving, with T-SQL / Step 1 – Train and save a model using T-SQL
    • operationalizing / Step 2 – Operationalize the model
    • integrating, for fast batch prediction / Step 1 – Train and save a real-time scoring model using T-SQL
  • R Script Editor / Integrating R in Power BI
  • R Services
    • predictions, performing in SQL Server database / Performing predictions with R Services in the SQL Server database
  • RStudio
    • URL / R Tools for Visual Studio (RTVS)
  • R Tools for Visual Studio (RTVS)
    • about / R Tools for Visual Studio (RTVS), Data frames in R, Operationalizing R code with Visual Studio
    • URL / R Tools for Visual Studio (RTVS), Configuring the environment and installing R Tools for Visual Studio (RTVS), Using R Tools for Visual Studio (RTVS) 2015 or higher
    • installing / Configuring the environment and installing R Tools for Visual Studio (RTVS)
    • used, for installing packages / Using R Tools for Visual Studio (RTVS) 2015 or higher
    • reference / Using R Tools for Visual Studio (RTVS) 2015 or higher, Data exploration and data munging, Operationalizing R code with Visual Studio
  • R workloads
    • and prediction operation, integrating / Integrating R workloads and prediction operations beyond SQL Server
  • rxInstallPackages function
    • using / Using the rxInstallPackages function
  • rxLinMod performance
    • testing, on table with primary key / Testing rxLinMod performance on a table with a primary key
    • testing, on table with clustered ColumnStore index / Testing rxLinMod performance on a table with a clustered ColumnStore index
    • testing, on memory-optimized table with primary key / Testing rxLinMod performance on a memory-optimized table with a primary key
    • testing, on memory-optimized table with clustered ColumnStore index / Testing rxLinMod performance on a memory-optimized table with a clustered ColumnStore index
    • test results, comparing / Comparing results

S

  • SAS
    • URL / Data import from SAS, SPSS, and ODBC
    • data, importing / Importing SAS data
  • scatter plot / Scatter plot
  • security
    • managing / Security
    • Resource Governor / Resource Governor
    • R packages, installing / Installing new R packages
  • SPSS
    • URL / Data import from SAS, SPSS, and ODBC
    • data, importing / Importing SPSS data
  • SPSS Modeler
    • URL / Data import from SAS, SPSS, and ODBC
  • sp_execute_external_script external procedure / Getting to know the sp_execute_external_script external procedure
  • SQL Operations Studio
    • URL / Example – R data visualization in T-SQL, Data modeling, Built-in JSON capabilities
    • about / Preparing your environment for the database lifecycle workflow
  • SQL Server
    • R, integrating / Boosting analytics with SQL Server R integration
    • data types / Understanding SQL and R data types
    • data, importing into R / Importing SQL Server data into R
  • SQL Server 2016
    • URL / Analytical barriers
  • SQL Server 2017
    • requisites / Minimum requirements
    • URL / Minimum requirements
    • edition, selecting / Choosing the edition
    • installing / Choosing the edition
    • database, configuring / Configuring the database
    • environment, configuring / Configuring the environment and installing R Tools for Visual Studio (RTVS)
    • R Tools for Visual Studio (RTVS), installing / Configuring the environment and installing R Tools for Visual Studio (RTVS)
  • SQL Server Analysis Services (SSAS) / Using R prior to SQL Server 2016, Overcomming R language limitations
  • SQL Server database
    • predictions, performing with R Services / Performing predictions with R Services in the SQL Server database
  • SQL Server Database lifecycle workflow
    • R, integrating / Integrating R into the SQL Server Database lifecycle workflow
    • environment, preparing / Preparing your environment for the database lifecycle workflow
    • projects/solutions, coding / Preparing your environment for the database lifecycle workflow
    • projects/solutions, managing / Preparing your environment for the database lifecycle workflow
    • unit testing / Preparing your environment for the database lifecycle workflow
    • Version Control / Preparing your environment for the database lifecycle workflow
    • CI/CD / Preparing your environment for the database lifecycle workflow
    • prerequisites / Creating the SQL Server database project
    • database project, creating / Creating the SQL Server database project
    • existing database, importing into project / Importing an existing database into the project
    • stored procedure object, adding / Adding a new stored procedure object
    • schema changes, publishing / Publishing schema changes
    • unit test, adding against stored procedure / Adding a unit test against a stored procedure
  • SQL Server Integration Services (SSIS)
    • about / Integrating R workloads and prediction operations beyond SQL Server
    • R script, operationalizing / Operationalizing R script as part of SSIS
  • SQL Server Machine Learning Services
    • managing, with DMVs / Managing SQL Server Machine Learning Services with DMVs
  • SQL Server Management Studio (SSMS)
    • about / Integrating R in SSRS reports, Preparing your environment for the database lifecycle workflow
    • using / Using SSMS as part of operationalizing R script
    • URL / Using SSMS as part of operationalizing R script
    • custom reports, using for SQL Server R Services / Using custom reports for SQL Server R Services
    • SQL Server Machine Learning Services, managing with DMVs / Managing SQL Server Machine Learning Services with DMVs
  • SQL Server Reporting Services (SSRS)
    • about / Integrating R code in reports and visualizations
    • reference / Integrating R in SSRS reports
  • SQL Server R Services
    • managing, with PowerShell / Managing SQL Server R Services with PowerShell
  • SQL Server Unit Test in Visual Studio
    • URL / Preparing your environment for the database lifecycle workflow
  • SSRS reports
    • R integration / Integrating R in SSRS reports
  • SSRS Report Server / Integrating R in SSRS reports

T

  • T-SQL
    • R data visualization / Example – R data visualization in T-SQL
  • Tablix / Integrating R in SSRS reports
  • Team Foundation Server (TFS)
    • URL / Preparing your environment for the database lifecycle workflow
  • Team Foundation Version Control (TFVC) / Preparing your environment for the database lifecycle workflow
  • tools
    • for operationalizing R code / Tools
    • SQL Server Management Studio (SSMS), using / Using SSMS as part of operationalizing R script
    • Visual Studio, using / Operationalizing R code with Visual Studio
  • tree diagram / Tree diagram
  • tSQLt
    • URL / Preparing your environment for the database lifecycle workflow

V

  • version control
    • using / Using version control
  • visualizations
    • R integration / Integrating R code in reports and visualizations
  • Visual Studio
    • about / Integrating R in SSRS reports
    • used, for operationalizing R code / Operationalizing R code with Visual Studio
    • URL / Operationalizing R code with Visual Studio, Preparing your environment for the database lifecycle workflow
    • URL, for VSTS project / Using version control
  • Visual Studio Team Services (VSTS)
    • URL / Preparing your environment for the database lifecycle workflow
    • continuous integration (CI), setting up / Setting up continuous integration
    • build definition, creating / Creating a build definition in VSTS
    • build, deploying to local SQL Server instance / Deploying the build to a local SQL Server instance
    • test phase, adding to build definition / Adding the test phase to the build definition
    • build, automating for CI / Automating the build for CI
    • continuous delivery, setting up / Setting up continuous delivery
    • reference / Setting up continuous delivery

W

  • WideWorldImporters database
    • URL / Data exploration and data munging
  • workloads
    • roles, managing / Managing roles and permissions for workloads
    • permissions, managing / Managing roles and permissions for workloads
    • extensibility framework workloads / Extensibility framework workloads
    • fast batch prediction workloads / Fast batch prediction workloads
    • external packages / External packages
    • creating / Creating a baseline and workloads, and replaying
    • replaying / Creating a baseline and workloads, and replaying
  • write-ahead log (WAL) / Gathering relevant data

X

  • XDT Transform
    • about / Setting up continuous delivery
    • URL / Setting up continuous delivery
  • XP_CMDSHELL
    • using / Using XP_CMDSHELL
lock icon The rest of the chapter is locked
arrow left Previous Section
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime
Visually different images