2015 HortonWorks MDA Roadshow Presentation

18
Copyright © 2015, SAS Institute Inc. All rights reserved. Big Data Analytics with SAS and Hadoop Felix Liao Business Solutions Manager SAS Australia/New Zealand

Transcript of 2015 HortonWorks MDA Roadshow Presentation

Page 1: 2015 HortonWorks MDA Roadshow Presentation

Copyright © 2015, SAS Institute Inc. All rights reserved.

Big Data Analytics with SAS and HadoopFelix LiaoBusiness Solutions ManagerSAS Australia/New Zealand

Page 2: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Agenda

5 things you didn’t know about SAS (and Hadoop)

Page 3: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

#1 SAS is the largest private software company in the world

1000+ customer sites in Australia & New Zealand

A market leader in the areas of Data Management, Reporting and Advanced Analytics

23% annual re-investment in R&D

Page 4: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

#2 SAS has been doing machine learning for 39 years

SAS is the "800-pound gorilla" in the analytics space

- Gartner

Page 5: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Breadth and Depth of Analytical CapabilitiesAppend

Data

PartitionFile

Import Filter Merge SampleSAMPLE

Association DMDB

MultiPlotEXPLORE Graph

ExploreLink Analysis

Path AnalysisSOM/Kohonen

StatExploreVariable

ClusteringVariable

SelectionMarket Basket

Cluster

MODIFY DropRules Builder

ReplacementPrincipal Components

Interactive BinningImpute

Transform Variables

Decision Tree

AutoNeural Neural NetworkRegression

Partial Least Squares

Dmine Regression

MODEL

DM Neural

Ensemble

Rule Induction

Gradient Boosting

LARS

MBR

Two Stage

Model Import

Incremental Response

Survival Analysis

Credit Scoring*

TS Correlation

TS Data Prep

TS Dimension Reduction

TS Decomp.

TS Similarity

TS Exponential Smoothing

HP Explore

HP ImputeHP

RegressionHP

TransformHP Variable Selection

HP Neural

HP Forest

HP Decision Tree

HP Data Partition

HP GLM HP Cluster

HP Principal ComponentsHP SVM

Cutoff Segment ProfileASSESS Model Comparison

ScoreDecisions

UTILITY Control Point

MetadataSAS Code

ReporterEnd Groups Score Code

ExportStart Groups Ext Demo

Input

Data

Open Source Integration

Register Metadata

Save Data

Page 6: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

#3 SAS is serious and committed about Hadoop

Hadoop as catalyst for big data analytics Bringing SAS analytics to Hadoop Joint R&D effort with leading Hadoop vendors

Page 7: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Open Data Platform Initiative

SAS is a founding member of the open data platform (ODP) initiative

Accelerate innovations around a stable common core platform

Maximize big data adoption and productivity

Page 8: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

#4 SAS is a certified workload engine on YARN

We are very excited today to announce the next step in our joint journey

achieved by integrating SAS HPA and LASR with the YARN resource manager

so it will run as a first class citizen in the Hadoop cluster, co-existing and sharing

cluster resources with other YARN enabled workloads running Hadoop and third-party YARN enabled applications.

Arun C. Murthy

Page 9: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Page 10: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

SAS & Hadoop Accelerating the Analytical Life Cycle

Prepare data IN Hadoop for analytics

Deploy and manage model score code IN

Hadoop

Lift data IN to memory for analytics at scale

Model data at scale in-memory WITH advanced

modeling tools

Explore data at scale, in-memory WITH data

visualization

Page 11: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Prepare Hadoop Data: SAS Data Loader for Hadoop

Page 12: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Hadoop Data Discovery: SAS Visual Analytics

Page 13: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Model Development: SAS In-Memory Statistics for Hadoop

Page 14: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

#5 SAS is delivering big data analytics today!

Now we can run hundreds and thousands of models at the product level - at the SKU level

- because you have the big data and analytics to support those models at that

level.

- Kerem Tomak (VP of Analytics)

We have a lot of data, but now we can start unleashing the power of that information

- Joanna Gurry (Head of Information)

Page 15: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

SAS and Hortonworks - Rogers Media

40 million records per month in Hortonworks HDP

More than 600 relevant web characteristics Processing data on 12 million customers SAS High Performance Analytics to place

better targeted ads “Several of us from Rogers in the room looked at each

other, and said ‘That is really wicked; that’s cool.”

Chris Dingle

Senior Director of Audience Solutions

Rogers Communications

https://www.youtube.com/watch?v=YFtrK02VaM4

Page 16: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

Five things you now know about SAS and Hadoop! #1 SAS is the largest private software company in the world #2 SAS has been doing machine learning for 39 years #3 SAS is serious and committed about Hadoop #4 SAS is a certified workload engine on YARN #5 SAS is delivering big data analytics today

Page 17: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2014, SAS Ins t i tute Inc . A l l r i gh ts r es erved.http://www.sas.com/au/sashadoop

Page 18: 2015 HortonWorks MDA Roadshow Presentation

Copyr igh t © 2012, SAS Ins t i tute Inc . A l l r i gh ts r es erved.

[email protected]

@felixliao

felixliao Thank You!

http://www.sas.com/au/sashadoop