SlideShare a Scribd company logo
1 of 30
Download to read offline
Sam  Lightstone
Dis0nguished  Engineer
© 2015 IBM Corporation
The  Science  and  Engineering  
Behind  Super  Fast  Load-­‐and-­‐Go  In-­‐
memory  Analy0cs
IBM Analytics Platform Group
1.  Next generation in-memory SQL data warehousing
" Super	
  Fast	
  (query	
  performance)	
  
" Super	
  Simple	
  (load-­‐and-­‐go)	
  	
  
" Super	
  Small	
  (RAM	
  and	
  storage	
  savings)	
  
2.  Seamlessly integrated
" Built	
  seamlessly	
  into	
  both	
  dashDB	
  (cloud)	
  and	
  DB2	
  (soAware)	
  
" Consistent	
  SQL,	
  language	
  interfaces,	
  administraGon	
  
" DramaGc	
  simplificaGon	
  
3.  Hardware optimized
" Memory	
  opGmized	
  
" CPU-­‐opGmized	
  
" I/O	
  opGmized	
  
BLU	
  Accelera/on	
  
2
35X-73X faster
…than traditional row-organized relational
database technology, on average.
“Compared to our current production system, DB2 10.5 with BLU Acceleration is running 106x
faster for our Admissions and Enrollment workloads. We had one query that we would often
cancel if it didn’t finish in 30 minutes. Now it runs in 56 seconds every time. 32x faster,
predictable response time, no tuning…what more could we ask for?”
- Brenda Boshoff, Sr. DBA
“Wow…unbelievable speedup in query run times! We saw a speedup of 273x
in our Vehicle Tracking report, taking a query from 10 minutes to 2.2 seconds.
That adds value to our business; our end users are going to be ecstatic!”
- Ruel Gonzalez - Information Services
DataProxy LLC
“My largest row-organized, adaptive compressed table gave me 3.2x storage savings. However,
converting this row-organized uncompressed table to a column-organized table in DB2 10.5
delivered a massive 15.4x savings!”
- Iqbal Goralwalla, Head of DB2 Managed Services, Triton
Records: 76M
Columns: 61
Indexes: 10
Load	
  Time
Row-­‐unc 15:39:10
Col 1:10:29(97%)
31.5x storage
savings
(97% less storage required)
13.5x faster load time
Storage Savings
6
Super simple
CREATE. LOAD. GO!
Risk system injects 1/2 TB per night
from 25 different source systems.
“Impressive Load times.”
Some queries achieved an almost
100x speed up with literally no
tuning.
6 hours.
Installing BLU
to query results.
One of the world’s most profitable
and secure rated banks.
Load-­‐and-­‐go	
  simplicity	
  
•  No	
  indexes	
  
•  No	
  storage	
  reclaim	
  	
  (it’s	
  automated)	
  
•  No	
  memory	
  configuraGon	
  (it’s	
  automated)	
  
•  No	
  process	
  model	
  configuraGon	
  (it’s	
  
automated)	
  
•  No	
  staGsGcs	
  collecGon	
  (it’s	
  automated)	
  
•  No	
  MDC	
  or	
  MQTs	
  
•  No	
  StaGsGcal	
  views	
  
•  No	
  opGmizer	
  profiles/guidelines	
  
	
  
“The BLU Acceleration technology has some obvious benefits: … But it’s when I think about
all the things I don't have to do with BLU, it made me appreciate the technology even
more: no tuning, no partitioning, no indexes, no aggregates.”
-Andrew Juarez, Lead SAP Basis and DBA
8
Simple.	
  
•  LOAD	
  and	
  then…	
  run	
  queries	
  
	
  
In-­‐memory	
  speeds	
  with	
  /ny	
  RAM	
  requirements	
  
•  Rule	
  of	
  thumb:	
  RAM	
  requirements	
  are	
  just	
  5%	
  of	
  source	
  data	
  
size.	
  	
  
•  Example:	
  	
  
•  10	
  Terabytes	
  of	
  raw	
  user	
  data	
  
•  500	
  GB	
  of	
  RAM	
  
9
10
Magic: How did they do that?
BLU Acceleration
Easiest. Fastest.
Smallest.
IBM Means
Innovation
BLU	
  Acceleration	
  includes	
  over	
  30	
  new	
  
patents	
  and	
  patents	
  pending	
  from	
  
IBM	
  Research	
  &	
  Development	
  
Laboratories.	
  
11
Looking at Disk Improvements
(courtesy of D. DeWitt, PASS Summit Keynote 2009)
!  Incredibly inexpensive drives (& processors) have made it
possible to collect, store, and analyze huge quantities of data
12
But, consider the metric transfer bandwidth/byte
Over the last 30 years
Capacity:	
  
80MB	
  "	
  800GB	
  
10,000x	
  
	
  
Transfer	
  Rates:	
  
1.2MB/sec	
  "	
  80MB/sec	
  
65x	
  
	
  
!  1980: 1.2 MB/sec / 80 MB = 0.015
!  2009: 80 MB/sec / 800,000 MB =.0001
When relative capacities are factored in, drives are 150X slower
today!!!
We’ve	
  been	
  snookered!	
  
Source: http://www.jcmit.com/mem2013.htm
Main Memory
CPU cache optimized
• RAM is at the bottom of the memory
hierarchy. It is the slowest non-persistent
memory in a server.
• CPU cache is many times faster than
RAM.
• Extreme re-engineering of database
algorithms to be CPU cache optimized
• BLU algorithms adapt automatically to
hardware cache size.
Speed-up versus RAM
15x
41x
173x
1x
0
20
40
60
80
100
120
140
160
180
200
RAM L3 L2 L1
Speed-upversusRAM
Big Idea: RAM is Too Slow
14
More Evil Than Ever
1. Human intervention
2. Voluminous I/O
3. Random I/O
4. Memory stalls
5. Single core processing
6. The cost of RAM
Load-and–go simplicity: System resources
CPU CPUCPU CPU CPU CPU CPU
RAM1. Auto-detect and adapt
to available RAM
2. Auto detect and adapt to
core number and type
3. Auto detect and adapt to
CPU cache size
Load-and-go simplicity: Automatic Workload Management
• Built-in and automated query resource consumption control
• Many queries can be submitted, but effective concurrency, per query RAM,
and CPU are automatically and dynamically controlled
...
Applications and Users
Up to tens of thousands of
SQL queries at once
BLU Runtime
SQL Queries
Moderate number of queries
consume resources
17
Load-and-go simplicity: Automatic Space Reclaim
•  Automatic space reclamation
•  Frees extents with no active values
•  The storage can be subsequently reused by any
table in the table space
•  No need for costly DBA space management and
REORG utility
•  Space is freed online while
work continues
•  Regular space management can result in
increased performance of RUNSTATS and some
queries
Column
3
Column
1
Column
2
2012 2012
2012
2012
DELETE * FROM MyTable
WHERE Year = 2012
These extents hold only
deleted data
Storage
extent
2013 2013 2013
2013
18
•  No CPU for decompression
•  Data flows through memory
and CPU at compressed size
•  “Actionable Compression”
QUERY	

DATA DATA
DATA
•  10-20X smaller is common
•  Compress as small as 1 bit
•  Compress the most frequent
data the smallest
THE DATA
HONEY, I
SHRUNK
BIG tiny
•  Reduce I/O
•  Increase data density
in RAM
•  Increase CPU
efficiency
C1 C2 C3 C4 C5 C6 C7 C8C1 C2 C3 C4 C5 C6 C7 C8
•  Cache intelligently for
analytics
•  Predictive I/O with
“Dynamic List
Prefetching”
•  Massive I/O reduction
RAM	

DISKS	

•  Queries skip
uninteresting data
•  Synopses on every
column,
automatically.
•  “Data Skipping”
•  Use modern SIMD instructions to do
multiple operations with a single
instruction.
•  Use 1 instruction instead of 8.
•  Massive CPU acceleration
•  Store data as vectors
•  “Parallel Vector Processing”
•  RAM is too slow for BLU !
•  Redesign the query engine to
operate at CPU cache speeds
instead
•  CPU cache is 10-75X faster than
RAM access.
BLU Acceleration Illustration
10TB query in seconds or less
10TB data
Actionable Compression
reduces to 1TB
In-memory
Parallel Processing
32MB linear scan
on each core via
Scans as fast as
8MB through SIMD and CPU-
cache optimized algorithms
Result in
seconds or less
Column Processing
reduces to 10GB
Data Skipping
reduces to 1GB
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
   DATA	
   DATA	
  
DATA	
  
DATA	
   DATA	
  
#  The System: 32 cores, 1TB memory, 10TB table with 100 columns and 10 years of data
#  The Query: How many “sales” did we have in 2010?
-  SELECT COUNT(*) from MYTABLE where YEAR = ‘2010’
#  The Result: In seconds or less as each CPU core examines the equivalent of just 8MB of data
22
BLU Acceleration Illustration
10TB query in seconds or less
• 1GB RAM required to cache all data for query over 10TB of data.
• 10,000 times reduction in memory requirements to achieve in-
memory speeds
10TB data
Actionable Compression
reduces to 1TB
In-memory
Parallel Processing
32MB linear scan
on each core via
Scans as fast as
8MB through SIMD and CPU-
cache optimized algorithms
Result in
seconds or less
Column Processing
reduces to 10GB
Data Skipping
reduces to 1GB
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
  
DATA	
   DATA	
   DATA	
  
DATA	
  
DATA	
   DATA	
  
23
IBM	
  dashDB	
  –	
  Data	
  Warehousing	
  as	
  a	
  Service	
  
Terabytes of data ready to analyze
within minutes
•  Fast querying
•  In-memory
•  Columnar
•  SIMD hardware acceleration
•  Actionable compression
•  Advanced analytics
•  Support for OLAP SQL extensions
•  In-database analytics & R for
predictive modeling
•  Spatial analytics
•  Easy to integrate
•  Connect common 3rd party BI tools
keeps data warehouse
infrastructure out of your way
26
The dashDB Cocktail: Three Parts IBM, Shaken…
BLU !
Acceleration!
Netezza !
In-Database
Analytics!
SoftLayer
Infrastructure !
as a Service!
Build More
Grow More
Know More
Focus on the business, "
not the business of data warehousing!
Partnership	
  with	
  IBM	
  Cloudant	
  and	
  dashDB	
  
•  Cloudant is a fully managed distributed !
NoSQL Database as a Service (DBaaS)!
•  Multi-tenant and single (dedicated) tenants!
•  Managed 24x7 by Cloudant expert engineers!
•  Service Level Agreement!
•  Operational data store!
•  dashDB’s partnership with Cloudant presents two opportunities:!
1.  A launching point for new NoSQL customers looking for low-risk, low-cost
avenues for getting started with data warehousing & analytics in a!
multi-tenant environment on the cloud!
2.  A new point of entry for existing Cloudant developers to access !
industry-leading INZA analytics and BLU warehousing!
3.  Automatic schema discovery from theJSON data. One button push to create
warehouse. !
dashDB	
  MPP	
  	
  
~elas/c	
  growth	
  
1.  4TB	
  building	
  blocks	
  (subject	
  to	
  change)	
  
2.  Min	
  3	
  servers.	
  	
  
3.  Elastic	
  growth	
  without	
  re-­‐hashing	
  all	
  row	
  data	
  
4.  Webscale	
  clustering	
  
BLU ACCELERATION
MPP Scale-out of dashDB with BLU Acceleration
CPUsCPUsCPUsCPUs
BLU Acceleration
Dynamic In-Memory Processing
CPUsCPUsCPUsCPUs
BLU Acceleration
Dynamic In-Memory Processing
Now in
Closed
Beta!
The	
  what	
  &	
  the	
  wow	
  
1.  Fast:	
  Superior	
  Performance	
  35-­‐73X	
  
2.  Small	
  RAM	
  requirements:	
  Typically	
  just	
  5%	
  of	
  
the	
  original	
  uncompressed	
  user	
  data.	
  	
  
3.  Small:	
  Superior	
  Compression	
  
4.  Simple:	
  Create.	
  Load.	
  Go.	
  	
  
5.  Introducing	
  dashDB	
  for	
  cloud	
  	
  
•  Fully	
  managed	
  service	
  on	
  IBM	
  BlueMix	
  	
  
•  JSON	
  NoSQL	
  integraGon,	
  Cloudant.com	
  
•  R	
  and	
  SpaGal	
  analyGcs	
  
6.  SAP	
  BW	
  cerGfied	
  
7.  Oracle	
  compaGbility	
  @98%	
  +	
  
	
  
“We cut report runtimes by up
to 98% thanks to IBM DB2 with
BLU Acceleration technology –
without changing operations
processes or investing in new
hardware or software. We were
impressed how easy boosting
database performance can be.”
-Bernhard Herzog, Team Manager
Information Technology SAP, Balluff
Where	
  to	
  find	
  more	
  informa/on:	
  
•  dashDB.com	
  
•  ibmbluhub.com	
  
•  Sam	
  Lightstone’s	
  BLOG:	
  SoAwareTradecraA.com	
  
•  Sam	
  Lightstone’s	
  papers	
  on	
  DBLP:	
  	
  	
  
•  hjp://dblp.uni-­‐trier.de/pers/hd/l/Lightstone:Sam	
  
30

More Related Content

What's hot

Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...Danielle Womboldt
 
Ceph Day Beijing - Ceph RDMA Update
Ceph Day Beijing - Ceph RDMA UpdateCeph Day Beijing - Ceph RDMA Update
Ceph Day Beijing - Ceph RDMA UpdateDanielle Womboldt
 
Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Community
 
SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4UniFabric
 
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIn-Memory Computing Summit
 
SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3UniFabric
 
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph clusterCeph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph clusterCeph Community
 
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash TechnologyCeph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash TechnologyCeph Community
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Danielle Womboldt
 
UniPlex vScaleDB pat. pending
UniPlex vScaleDB pat. pendingUniPlex vScaleDB pat. pending
UniPlex vScaleDB pat. pendingUniFabric
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RSimon Huang
 
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Community
 
Redis on NVMe SSD - Zvika Guz, Samsung
 Redis on NVMe SSD - Zvika Guz, Samsung Redis on NVMe SSD - Zvika Guz, Samsung
Redis on NVMe SSD - Zvika Guz, SamsungRedis Labs
 
Ceph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephCeph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephDanielle Womboldt
 
Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data Ceph Community
 
Webinar: What’s Your Path to NVMe?
Webinar: What’s Your Path to NVMe?Webinar: What’s Your Path to NVMe?
Webinar: What’s Your Path to NVMe?Storage Switzerland
 
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server Ceph Community
 
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Patrick McGarry
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSCeph Community
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Community
 

What's hot (20)

Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
Ceph Day Beijing - Optimizing Ceph Performance by Leveraging Intel Optane and...
 
Ceph Day Beijing - Ceph RDMA Update
Ceph Day Beijing - Ceph RDMA UpdateCeph Day Beijing - Ceph RDMA Update
Ceph Day Beijing - Ceph RDMA Update
 
Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage Ceph Day KL - Ceph on All-Flash Storage
Ceph Day KL - Ceph on All-Flash Storage
 
SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4SOUG_GV_Flashgrid_V4
SOUG_GV_Flashgrid_V4
 
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory EasyIMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
IMCSummit 2016 Keynote - Benzi Galili - More Memory for In-Memory Easy
 
SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3SOUG_SDM_OracleDB_V3
SOUG_SDM_OracleDB_V3
 
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph clusterCeph Day KL - Delivering cost-effective, high performance Ceph cluster
Ceph Day KL - Delivering cost-effective, high performance Ceph cluster
 
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash TechnologyCeph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
 
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
Ceph Day Beijing - Our journey to high performance large scale Ceph cluster a...
 
UniPlex vScaleDB pat. pending
UniPlex vScaleDB pat. pendingUniPlex vScaleDB pat. pending
UniPlex vScaleDB pat. pending
 
Introduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3RIntroduction to NVMe Over Fabrics-V3R
Introduction to NVMe Over Fabrics-V3R
 
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Ceph at Salesforce
 
Redis on NVMe SSD - Zvika Guz, Samsung
 Redis on NVMe SSD - Zvika Guz, Samsung Redis on NVMe SSD - Zvika Guz, Samsung
Redis on NVMe SSD - Zvika Guz, Samsung
 
Ceph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and CephCeph Day Beijing - Storage Modernization with Intel and Ceph
Ceph Day Beijing - Storage Modernization with Intel and Ceph
 
Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data Ceph Day San Jose - Object Storage for Big Data
Ceph Day San Jose - Object Storage for Big Data
 
Webinar: What’s Your Path to NVMe?
Webinar: What’s Your Path to NVMe?Webinar: What’s Your Path to NVMe?
Webinar: What’s Your Path to NVMe?
 
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
 
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
Using Recently Published Ceph Reference Architectures to Select Your Ceph Con...
 
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDSAccelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
Accelerating Cassandra Workloads on Ceph with All-Flash PCIE SSDS
 
Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK Ceph Day Taipei - Accelerate Ceph via SPDK
Ceph Day Taipei - Accelerate Ceph via SPDK
 

Similar to IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind In-Memory Analytics

Exadata architecture and internals presentation
Exadata architecture and internals presentationExadata architecture and internals presentation
Exadata architecture and internals presentationSanjoy Dasgupta
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics PlatformSantanu Dey
 
Oracle Database in-Memory Overivew
Oracle Database in-Memory OverivewOracle Database in-Memory Overivew
Oracle Database in-Memory OverivewMaria Colgan
 
How AI and ML are driving Memory Architecture changes
How AI and ML are driving Memory Architecture changesHow AI and ML are driving Memory Architecture changes
How AI and ML are driving Memory Architecture changesDanny Sabour
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsDATAVERSITY
 
Caching Methodology & Strategies
Caching Methodology & StrategiesCaching Methodology & Strategies
Caching Methodology & StrategiesTiệp Vũ
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategiesTiep Vu
 
A5 oracle exadata-the game changer for online transaction processing data w...
A5   oracle exadata-the game changer for online transaction processing data w...A5   oracle exadata-the game changer for online transaction processing data w...
A5 oracle exadata-the game changer for online transaction processing data w...Dr. Wilfred Lin (Ph.D.)
 
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
SQream DB - Bigger Data On GPUs: Approaches, Challenges, SuccessesSQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
SQream DB - Bigger Data On GPUs: Approaches, Challenges, SuccessesArnon Shimoni
 
Handling Massive Writes
Handling Massive WritesHandling Massive Writes
Handling Massive WritesLiran Zelkha
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisMike Pittaro
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis PyData
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarKognitio
 
Hadoop Operations: Keeping the Elephant Running Smoothly
Hadoop Operations: Keeping the Elephant Running SmoothlyHadoop Operations: Keeping the Elephant Running Smoothly
Hadoop Operations: Keeping the Elephant Running SmoothlyMichael Arnold
 
MySQL Performance Tuning at COSCUP 2014
MySQL Performance Tuning at COSCUP 2014MySQL Performance Tuning at COSCUP 2014
MySQL Performance Tuning at COSCUP 2014Ryusuke Kajiyama
 
Severalnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines
 
Full scan frenzy at amadeus
Full scan frenzy at amadeusFull scan frenzy at amadeus
Full scan frenzy at amadeusMongoDB
 

Similar to IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind In-Memory Analytics (20)

Exadata architecture and internals presentation
Exadata architecture and internals presentationExadata architecture and internals presentation
Exadata architecture and internals presentation
 
Building a High Performance Analytics Platform
Building a High Performance Analytics PlatformBuilding a High Performance Analytics Platform
Building a High Performance Analytics Platform
 
Oracle Database in-Memory Overivew
Oracle Database in-Memory OverivewOracle Database in-Memory Overivew
Oracle Database in-Memory Overivew
 
How AI and ML are driving Memory Architecture changes
How AI and ML are driving Memory Architecture changesHow AI and ML are driving Memory Architecture changes
How AI and ML are driving Memory Architecture changes
 
In-Memory Computing: Myths and Facts
In-Memory Computing: Myths and FactsIn-Memory Computing: Myths and Facts
In-Memory Computing: Myths and Facts
 
Caching Methodology & Strategies
Caching Methodology & StrategiesCaching Methodology & Strategies
Caching Methodology & Strategies
 
Caching methodology and strategies
Caching methodology and strategiesCaching methodology and strategies
Caching methodology and strategies
 
Shootout at the PAAS Corral
Shootout at the PAAS CorralShootout at the PAAS Corral
Shootout at the PAAS Corral
 
A5 oracle exadata-the game changer for online transaction processing data w...
A5   oracle exadata-the game changer for online transaction processing data w...A5   oracle exadata-the game changer for online transaction processing data w...
A5 oracle exadata-the game changer for online transaction processing data w...
 
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
SQream DB - Bigger Data On GPUs: Approaches, Challenges, SuccessesSQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
SQream DB - Bigger Data On GPUs: Approaches, Challenges, Successes
 
Handling Massive Writes
Handling Massive WritesHandling Massive Writes
Handling Massive Writes
 
High Performance Hardware for Data Analysis
High Performance Hardware for Data AnalysisHigh Performance Hardware for Data Analysis
High Performance Hardware for Data Analysis
 
Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis Mike Pittaro - High Performance Hardware for Data Analysis
Mike Pittaro - High Performance Hardware for Data Analysis
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
Hadoop Operations: Keeping the Elephant Running Smoothly
Hadoop Operations: Keeping the Elephant Running SmoothlyHadoop Operations: Keeping the Elephant Running Smoothly
Hadoop Operations: Keeping the Elephant Running Smoothly
 
Coa presentation3
Coa presentation3Coa presentation3
Coa presentation3
 
MySQL Performance Tuning at COSCUP 2014
MySQL Performance Tuning at COSCUP 2014MySQL Performance Tuning at COSCUP 2014
MySQL Performance Tuning at COSCUP 2014
 
Severalnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IXSeveralnines Training: MySQL® Cluster - Part IX
Severalnines Training: MySQL® Cluster - Part IX
 
Full scan frenzy at amadeus
Full scan frenzy at amadeusFull scan frenzy at amadeus
Full scan frenzy at amadeus
 
Drupal performance
Drupal performanceDrupal performance
Drupal performance
 

More from In-Memory Computing Summit

IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...In-Memory Computing Summit
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage TierIMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage TierIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent MemoryIMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent MemoryIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIn-Memory Computing Summit
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...In-Memory Computing Summit
 
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...In-Memory Computing Summit
 

More from In-Memory Computing Summit (20)

IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
IMC Summit 2016 Breakout - Per Minoborg - Work with Multiple Hot Terabytes in...
 
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
IMC Summit 2016 Breakout - Henning Andersen - Using Lock-free and Wait-free I...
 
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing HubIMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
IMC Summit 2016 Breakout - Roman Shtykh - Apache Ignite as a Data Processing Hub
 
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
IMC Summit 2016 Breakout - Nikita Shamgunov - Propelling IoT Innovation with ...
 
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
IMC Summit 2016 Breakout - Matt Coventon - Test Driving Streaming and CEP on ...
 
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
IMC Summit 2016 Innovation - Derek Nelson - PipelineDB: The Streaming-SQL Dat...
 
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
IMC Summit 2016 Innovation - Dennis Duckworth - Lambda-B-Gone: The In-memory ...
 
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
IMC Summit 2016 Innovation - Steve Wilkes - Tap Into Your Enterprise – Why Da...
 
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X PlatformIMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
IMC Summit 2016 Innovation - Girish Mutreja - Unveiling the X Platform
 
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage TierIMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
IMC Summit 2016 Breakout - Ken Gibson - The In-Place Working Storage Tier
 
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
IMC Summit 2016 Breakout - Brian Bulkowski - NVMe, Storage Class Memory and O...
 
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
IMC Summit 2016 Breakout - Yanping Wang - Non-volatile Generic Object Program...
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
 
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent MemoryIMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
IMC Summit 2016 Breakout - Gordon Patrick - Developments in Persistent Memory
 
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
IMC Summit 2016 Breakout - Girish Kathalagiri - Decision Making with MLLIB, S...
 
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise GradeIMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
IMC Summit 2016 Breakout - Steve Wikes - Making IMC Enterprise Grade
 
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
IMC Summit 2016 Breakout - Noah Arliss - The Truth: How to Test Your Distribu...
 
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of StatelessnessIMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
IMC Summit 2016 Breakout - Aleksandar Seovic - The Illusion of Statelessness
 
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
IMC Summit 2016 Breakout - Girish Mutreja - Extreme Transaction Processing in...
 
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
IMC Summit 2016 Breakout - Greg Luck - How to Speed Up Your Application Using...
 

Recently uploaded

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

IMCSummit 2015 - Day 1 Developer Session - The Science and Engineering Behind In-Memory Analytics

  • 1. Sam  Lightstone Dis0nguished  Engineer © 2015 IBM Corporation The  Science  and  Engineering   Behind  Super  Fast  Load-­‐and-­‐Go  In-­‐ memory  Analy0cs IBM Analytics Platform Group
  • 2. 1.  Next generation in-memory SQL data warehousing " Super  Fast  (query  performance)   " Super  Simple  (load-­‐and-­‐go)     " Super  Small  (RAM  and  storage  savings)   2.  Seamlessly integrated " Built  seamlessly  into  both  dashDB  (cloud)  and  DB2  (soAware)   " Consistent  SQL,  language  interfaces,  administraGon   " DramaGc  simplificaGon   3.  Hardware optimized " Memory  opGmized   " CPU-­‐opGmized   " I/O  opGmized   BLU  Accelera/on   2
  • 3. 35X-73X faster …than traditional row-organized relational database technology, on average.
  • 4. “Compared to our current production system, DB2 10.5 with BLU Acceleration is running 106x faster for our Admissions and Enrollment workloads. We had one query that we would often cancel if it didn’t finish in 30 minutes. Now it runs in 56 seconds every time. 32x faster, predictable response time, no tuning…what more could we ask for?” - Brenda Boshoff, Sr. DBA “Wow…unbelievable speedup in query run times! We saw a speedup of 273x in our Vehicle Tracking report, taking a query from 10 minutes to 2.2 seconds. That adds value to our business; our end users are going to be ecstatic!” - Ruel Gonzalez - Information Services DataProxy LLC
  • 5. “My largest row-organized, adaptive compressed table gave me 3.2x storage savings. However, converting this row-organized uncompressed table to a column-organized table in DB2 10.5 delivered a massive 15.4x savings!” - Iqbal Goralwalla, Head of DB2 Managed Services, Triton Records: 76M Columns: 61 Indexes: 10 Load  Time Row-­‐unc 15:39:10 Col 1:10:29(97%) 31.5x storage savings (97% less storage required) 13.5x faster load time Storage Savings
  • 7. Risk system injects 1/2 TB per night from 25 different source systems. “Impressive Load times.” Some queries achieved an almost 100x speed up with literally no tuning. 6 hours. Installing BLU to query results. One of the world’s most profitable and secure rated banks.
  • 8. Load-­‐and-­‐go  simplicity   •  No  indexes   •  No  storage  reclaim    (it’s  automated)   •  No  memory  configuraGon  (it’s  automated)   •  No  process  model  configuraGon  (it’s   automated)   •  No  staGsGcs  collecGon  (it’s  automated)   •  No  MDC  or  MQTs   •  No  StaGsGcal  views   •  No  opGmizer  profiles/guidelines     “The BLU Acceleration technology has some obvious benefits: … But it’s when I think about all the things I don't have to do with BLU, it made me appreciate the technology even more: no tuning, no partitioning, no indexes, no aggregates.” -Andrew Juarez, Lead SAP Basis and DBA 8 Simple.   •  LOAD  and  then…  run  queries    
  • 9. In-­‐memory  speeds  with  /ny  RAM  requirements   •  Rule  of  thumb:  RAM  requirements  are  just  5%  of  source  data   size.     •  Example:     •  10  Terabytes  of  raw  user  data   •  500  GB  of  RAM   9
  • 10. 10 Magic: How did they do that?
  • 11. BLU Acceleration Easiest. Fastest. Smallest. IBM Means Innovation BLU  Acceleration  includes  over  30  new   patents  and  patents  pending  from   IBM  Research  &  Development   Laboratories.   11
  • 12. Looking at Disk Improvements (courtesy of D. DeWitt, PASS Summit Keynote 2009) !  Incredibly inexpensive drives (& processors) have made it possible to collect, store, and analyze huge quantities of data 12 But, consider the metric transfer bandwidth/byte Over the last 30 years Capacity:   80MB  "  800GB   10,000x     Transfer  Rates:   1.2MB/sec  "  80MB/sec   65x     !  1980: 1.2 MB/sec / 80 MB = 0.015 !  2009: 80 MB/sec / 800,000 MB =.0001 When relative capacities are factored in, drives are 150X slower today!!!
  • 13. We’ve  been  snookered!   Source: http://www.jcmit.com/mem2013.htm Main Memory
  • 14. CPU cache optimized • RAM is at the bottom of the memory hierarchy. It is the slowest non-persistent memory in a server. • CPU cache is many times faster than RAM. • Extreme re-engineering of database algorithms to be CPU cache optimized • BLU algorithms adapt automatically to hardware cache size. Speed-up versus RAM 15x 41x 173x 1x 0 20 40 60 80 100 120 140 160 180 200 RAM L3 L2 L1 Speed-upversusRAM Big Idea: RAM is Too Slow 14
  • 15. More Evil Than Ever 1. Human intervention 2. Voluminous I/O 3. Random I/O 4. Memory stalls 5. Single core processing 6. The cost of RAM
  • 16. Load-and–go simplicity: System resources CPU CPUCPU CPU CPU CPU CPU RAM1. Auto-detect and adapt to available RAM 2. Auto detect and adapt to core number and type 3. Auto detect and adapt to CPU cache size
  • 17. Load-and-go simplicity: Automatic Workload Management • Built-in and automated query resource consumption control • Many queries can be submitted, but effective concurrency, per query RAM, and CPU are automatically and dynamically controlled ... Applications and Users Up to tens of thousands of SQL queries at once BLU Runtime SQL Queries Moderate number of queries consume resources 17
  • 18. Load-and-go simplicity: Automatic Space Reclaim •  Automatic space reclamation •  Frees extents with no active values •  The storage can be subsequently reused by any table in the table space •  No need for costly DBA space management and REORG utility •  Space is freed online while work continues •  Regular space management can result in increased performance of RUNSTATS and some queries Column 3 Column 1 Column 2 2012 2012 2012 2012 DELETE * FROM MyTable WHERE Year = 2012 These extents hold only deleted data Storage extent 2013 2013 2013 2013 18
  • 19. •  No CPU for decompression •  Data flows through memory and CPU at compressed size •  “Actionable Compression” QUERY DATA DATA DATA •  10-20X smaller is common •  Compress as small as 1 bit •  Compress the most frequent data the smallest THE DATA HONEY, I SHRUNK BIG tiny
  • 20. •  Reduce I/O •  Increase data density in RAM •  Increase CPU efficiency C1 C2 C3 C4 C5 C6 C7 C8C1 C2 C3 C4 C5 C6 C7 C8 •  Cache intelligently for analytics •  Predictive I/O with “Dynamic List Prefetching” •  Massive I/O reduction RAM DISKS •  Queries skip uninteresting data •  Synopses on every column, automatically. •  “Data Skipping”
  • 21. •  Use modern SIMD instructions to do multiple operations with a single instruction. •  Use 1 instruction instead of 8. •  Massive CPU acceleration •  Store data as vectors •  “Parallel Vector Processing” •  RAM is too slow for BLU ! •  Redesign the query engine to operate at CPU cache speeds instead •  CPU cache is 10-75X faster than RAM access.
  • 22. BLU Acceleration Illustration 10TB query in seconds or less 10TB data Actionable Compression reduces to 1TB In-memory Parallel Processing 32MB linear scan on each core via Scans as fast as 8MB through SIMD and CPU- cache optimized algorithms Result in seconds or less Column Processing reduces to 10GB Data Skipping reduces to 1GB DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   #  The System: 32 cores, 1TB memory, 10TB table with 100 columns and 10 years of data #  The Query: How many “sales” did we have in 2010? -  SELECT COUNT(*) from MYTABLE where YEAR = ‘2010’ #  The Result: In seconds or less as each CPU core examines the equivalent of just 8MB of data 22
  • 23. BLU Acceleration Illustration 10TB query in seconds or less • 1GB RAM required to cache all data for query over 10TB of data. • 10,000 times reduction in memory requirements to achieve in- memory speeds 10TB data Actionable Compression reduces to 1TB In-memory Parallel Processing 32MB linear scan on each core via Scans as fast as 8MB through SIMD and CPU- cache optimized algorithms Result in seconds or less Column Processing reduces to 10GB Data Skipping reduces to 1GB DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   DATA   23
  • 24. IBM  dashDB  –  Data  Warehousing  as  a  Service   Terabytes of data ready to analyze within minutes •  Fast querying •  In-memory •  Columnar •  SIMD hardware acceleration •  Actionable compression •  Advanced analytics •  Support for OLAP SQL extensions •  In-database analytics & R for predictive modeling •  Spatial analytics •  Easy to integrate •  Connect common 3rd party BI tools keeps data warehouse infrastructure out of your way
  • 25.
  • 26. 26 The dashDB Cocktail: Three Parts IBM, Shaken… BLU ! Acceleration! Netezza ! In-Database Analytics! SoftLayer Infrastructure ! as a Service! Build More Grow More Know More Focus on the business, " not the business of data warehousing!
  • 27. Partnership  with  IBM  Cloudant  and  dashDB   •  Cloudant is a fully managed distributed ! NoSQL Database as a Service (DBaaS)! •  Multi-tenant and single (dedicated) tenants! •  Managed 24x7 by Cloudant expert engineers! •  Service Level Agreement! •  Operational data store! •  dashDB’s partnership with Cloudant presents two opportunities:! 1.  A launching point for new NoSQL customers looking for low-risk, low-cost avenues for getting started with data warehousing & analytics in a! multi-tenant environment on the cloud! 2.  A new point of entry for existing Cloudant developers to access ! industry-leading INZA analytics and BLU warehousing! 3.  Automatic schema discovery from theJSON data. One button push to create warehouse. !
  • 28. dashDB  MPP     ~elas/c  growth   1.  4TB  building  blocks  (subject  to  change)   2.  Min  3  servers.     3.  Elastic  growth  without  re-­‐hashing  all  row  data   4.  Webscale  clustering   BLU ACCELERATION MPP Scale-out of dashDB with BLU Acceleration CPUsCPUsCPUsCPUs BLU Acceleration Dynamic In-Memory Processing CPUsCPUsCPUsCPUs BLU Acceleration Dynamic In-Memory Processing Now in Closed Beta!
  • 29. The  what  &  the  wow   1.  Fast:  Superior  Performance  35-­‐73X   2.  Small  RAM  requirements:  Typically  just  5%  of   the  original  uncompressed  user  data.     3.  Small:  Superior  Compression   4.  Simple:  Create.  Load.  Go.     5.  Introducing  dashDB  for  cloud     •  Fully  managed  service  on  IBM  BlueMix     •  JSON  NoSQL  integraGon,  Cloudant.com   •  R  and  SpaGal  analyGcs   6.  SAP  BW  cerGfied   7.  Oracle  compaGbility  @98%  +     “We cut report runtimes by up to 98% thanks to IBM DB2 with BLU Acceleration technology – without changing operations processes or investing in new hardware or software. We were impressed how easy boosting database performance can be.” -Bernhard Herzog, Team Manager Information Technology SAP, Balluff
  • 30. Where  to  find  more  informa/on:   •  dashDB.com   •  ibmbluhub.com   •  Sam  Lightstone’s  BLOG:  SoAwareTradecraA.com   •  Sam  Lightstone’s  papers  on  DBLP:       •  hjp://dblp.uni-­‐trier.de/pers/hd/l/Lightstone:Sam   30