Strata + Hadoop World San Jose 2015: Video Compilation
Video description
Go right to the heart of big data
Find out what happens when cutting-edge data science and new business fundamentals intersect. With this complete video compilation, you’ll be on hand for every presentation—whether it’s a keynote, a tutorial, or a workshop—held at the Strata Conference + Hadoop World Conference in San Jose, California during February, 2015.
In ten tracks, this year’s conference captured the most …
Strata + Hadoop World San Jose 2015: Video Compilation
Video description
Go right to the heart of big data
Find out what happens when cutting-edge data science and new business fundamentals intersect. With this complete video compilation, you’ll be on hand for every presentation—whether it’s a keynote, a tutorial, or a workshop—held at the Strata Conference + Hadoop World Conference in San Jose, California during February, 2015.
In ten tracks, this year’s conference captured the most challenging problems and compelling opportunities in data today, including:
Business & Industry: How organizations of all sizes use data to make better decisions
Connected World: Navigating in an always-connected, always-on world
Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
Hadoop & Beyond: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks
Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
Machine Data: Extracting meaningful insights from data collected and generated by things
Security: Fighting fraud, detecting threats, increasing trust—and securing data
You also have complete access to other conference events, such as Data-Driven Business Day, Hardcore Data Science Day, and Spark Camp.
Download these videos or stream them through our HD player, and gain a clear perspective on data, including all the analytics, architectures, techniques, tools, and technologies you need to use it successfully.
Hiding the Elephant - How Big Data Apps Make Magic While Hiding Hadoop - Ross Fubini, Ari Gesher, Wei Zheng, Omer Trajman, and Sylvain Le Borgne
Pumping Up Retail Profits with Predictive Analytics - Adam Jorgensen
If You Don’t Have Anything Nice to Say, Please Say Something: Increasing Honesty in Airbnb Reviews - Dave Holtz
Making Big Data Usable in Market Regulation - Scott Donaldson
WANTED: Women in Data, Tech, and STEM - Moderated by: Cornelia Lévy-Bencheton, Panelists: Michele Chambers, Alice Zheng and Neha Narkhede
Helping the Republican Party Use Data and Engineering to Win the US Senate - Azarias Reda
Using Big Data to Identify the World’s Top Experts - Nima Sarshar
The New Data Organization: What do Successful Data-Driven Companies Look Like? - John Haddad
Architecting for the Cloud - Chris Neumann
Solving Customer Problems with Big Data across Thomson Reuters - Brian Ulicny
Connected World
Improving Business Operations with Predictive Maintenance and Service - Oliver Mainka
Forget the Valley: Middle America Is Where Data Is Having Its Biggest Impact - Matt Asay
Robot Reporters: How The Associated Press Embraced Data Automation - Adam Smith
Which is More Interesting - Millions of Thermostats, or Millions of Minds in the Internet of Things? - Doug Stein
Economic Insights from LinkedIn’s Professional Network - June Andrews
Using Data to Help Farmers Feed Growing Populations in a Changing Climate - Stewart Collis
Data Science
Bots Don’t Drink Soda: Using Big Data to Find Real People - Michael Brown
How to Detect Anomalies in High Cardinality Dimensions and Make Them Actionable - Shankar Vedaraman and Christopher Colburn
Big Data and Design Working Together – When the Magic Happens - George Roumeliotis
HOWTO Make Your Future Data Scientists Love You - Sasha Laundy
From Academia to Data Science: Lessons Learned Founding the Insight Data Science Fellows Program - Jake Klamka and Kathy Copic
The Two Cultures of People Science - Michelangelo D’Agostino
Pro Bono Data Science in Action - Helping Teens in Crisis - Noelle Sio
Data Applications: Speed vs Accuracy - Danielle Ben-Gera
Behavior-driven Machine Translation - Irina Borisova and Asim Mathur
Playing Nice in the Product Playground: Data Scientists, Engineers, and Product Managers Working Together to Create Innovative Data Products - Anu Tewary, Lucian Lita and Jonathan Goldman
Machine Learning Building Blocks and the Workload Optimization Framework - Shai Fine
Robust Event Detection Using Diverse Data Types - Harrison Mebane
Purposeful Education with Job Market Data for Students, Educators, and Institutions - Jike Chong
Real-Time Relevance for Mobile at LinkedIn - Michael Conover
Design Interfaces
Building Interactive Data Visualizations - Jonathan Dinu - Part 1
Building Interactive Data Visualizations - Jonathan Dinu - Part 2
Building Interactive Data Visualizations - Jonathan Dinu - Part 3
Building Interactive Data Visualizations - Jonathan Dinu - Part 4
The Human-Data Interface: How to Design for “Irrational” Data Consumers - Cathy Tanimura
Designing Delightful Data Products - Alonzo Canada
Designing for Data - Etan Lightstone
Humanizing Data - Building Systems and Interfaces for Domain Experts - Ari Gesher and James Thompson
Architecting Interfaces that Learn - Tye Rattenbury and Jeffrey Heer
What Designers and Data Scientists Can Learn from Each Other - Danyel Fisher and Miriah Meyer
Data (Art ) Science - Eric Colson
Designing with Data: A Human-centered Approach to Data-driven Design - Arianna McClain and Coe Leta Stafford
Hadoop Beyond
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 1
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 2
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 3
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 4
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 5
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Krishna Sankar - Part 6
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Christopher Fregly - Part 7
Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 8
Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 1
Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 2
Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 3
Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 4
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 1
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 2
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 3
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 4
Going Real-time: Data Collection and Stream Processing with Apache Kafka - Jay Kreps
Stream Processing Everywhere - What to Use? - Jim Scott
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Engine - Richard Williamson
From MapReduce to Programming Frameworks: Making Sense of Cloud Dataflow, Spark and New Tools for Big Data - Eric Schmidt
Drill into Drill: How Providing Flexibility and Performance is Possible - Jacques Nadeau
Three Approaches to Scalable Data Curation - Michael Stonebraker
One Billion Objects in 2GB: Big Data Analytics on Small Clusters with Doradus OLAP - Randy Guck
Big Data at Netflix: Faster and Easier - Kurt Brown
Search Evolved: Unraveling Your Data - Costin Leau
The Year in Review - Key Changes in the Hadoop Platform in the Past 12 Months - Jairam Ranganathan
Building Interactive Data Applications at Scale - Fangjin Yang and Vadim Ogievetsky
YARN vs. MESOS: Can’t We All Just Get Along? - Ted Dunning
Hadoop Platform
Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 1
Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 2
Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 3
Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 4
Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 1
Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 2
Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 3
Building A Data Platform - Manu Mukerji, Stephen O’Sullivan, and John Akred - Part 1
Building A Data Platform - Manu Mukerji, Stephen O’Sullivan, and John Akred - Part 2
Building A Data Platform - Manu Mukerji, Stephen O’Sullivan, and John Akred - Part 3
Building A Data Platform - Manu Mukerji, Stephen O’Sullivan, and John Akred - Part 4
Hadoop Puzzlers Reloaded - Aaron Myers and Daniel Templeton
The Future of Apache Hadoop Security - Joey Echeverria
Making HBase Accessible to Scientists - Spencer Herath and Aaron Benz
Data Discovery on Hadoop - Sumeet Singh and Thiruvel Thirumoolan
Yarns about YARN: Migrating to MapReduce v2 - Kathleen Ting and Miklos Christine
Maintaining Low Latency while Maximizing Throughput on a Single Cluster - Yuliya Feldman
Running Production Hadoop Clusters in Docker Containers - Nasser Manesh
How to use Parquet as a Basis for ETL and Analytics - Julien Le Dem
Adding Insert, Update, and Delete to Hive - Alan Gates
Top Ten Pitfalls to Avoid in a SQL-on-Hadoop Implementation - Monte Zweben
Hadoop in Action
The Evolution of Hadoop at Spotify - Through Failures and Pain - Josh Baer and Rafal Wojdyla
From Source to Solution: Building A System for Machine and Event-Oriented Data - Eric Sammer
Design Patterns for Real Time Streaming Data Analytics - Sheetal Dolas
Stock Market Order Flow Reconstruction in HBase on AWS - Tigran Khrimian
Ticketmaster: Marketing and Selling the World’s Tickets - John Carnahan
Designing Data Architectures for Robust Decision Making - Gwen Shapira
Friction-Free ETL: Automating Data Transformation with Impala - Marcel Kornacker
The Truth About MapReduce Performance on SSDs - Yanpei Chen and Karthik Kambatla
Hadoop as a Platform for Genomics - Allen Day and Sungwook Yoon
Law, Ethics Open Data
Data Scientists and Lawyers - a Marriage made in Silicon Valley - Laura Fennell and Bill Loconzolo
Big Data Ethics and a Future for Privacy - Jonathan King
How Minority Becomes Majority - A Study of Gerrymandering - Tatsiana Maskalevich
Machine Data / IoT
Transformational Case Studies in Machine Data Telemetry - Chad Meley and John Kreisa
TSAR (the TimeSeries AggregatoR) - How to Count Tens of Billions of Daily Events in Real Time Using Open Source Technologies - Anirudh Todi
An Open Source Approach to Gathering and Analyzing Device Sourced Health Data - Ian Eslick
Building Adaptive Apps with APIs and Data - Anant Jhingran
Dynamic Events in Massive Data Streams, from Astrophysics to Marketing Automation - Kirk Borne
Forecasting Space-time Events - Jeremy Heffner
The IoT P2P Backbone - Bruno Fernandez-Ruiz
The Sushi Principle: Raw Data Is Better - Joseph Adler and Robert Johnson
Practical Methods for Identifying Anomalies That Matter in Large Datasets - Robert Grossman
Streaming Analytics: It’s Not The Same Game - Subutai Ahmad
Machine Learning For Oil Exploration - Ben Hamner
Security
Data Science vs. The Bad Guys: Using Data to Defend LinkedIn Against Fraud and Abuse - David Freeman
How to Ensure Your Hadoop Installation is Not the Next Big Data Breach - Terence Spies
Securing the New Wearable World - Gary Davis
The Physics of Apache Hadoop: Choosing the Right Hardware and OS Configuration Mix for Your Workloads - Woody Christy, Steve Anderson, Patrick Schots and Floris Grandvarlet
Enterprise Adoption
Database History from Codd to Brewer and Beyond - Douglas Turnbull
Ideal Platform for Managing Log Data: Search or SQL? - Vinayak Borkar
Getting Started with Data Governance: Paths Converge from Multiple Starting Points - Paula Wiles Sigmon
Don’t Let Today’s Demands Kill Tomorrow’s Workforce! - Martin Waterhouse
Spark in Action
Lessons from Running Large Scale Spark Workloads - Reynold Xin and Matei Zaharia
Introducing Hive’s New Execution Engine - Spark - Xuefu Zhang and Chengxiang Li
Machine Learning with H2O and Spark - Cliff Click and Michal Malohlava
Spark Streaming - The State of the Union, and Beyond - Tathagata Das
Why Spark Is the Next Top (Compute) Model - Dean Wampler
Tuning and Debugging in Apache Spark - Patrick Wendell
Everyday I’m Shuffling - Tips for Writing Better Spark Programs - Vida Ha and Holden Karau
Hardcore Data Science
Beyond DNNs towards New Architectures for Deep Learning, with Applications to Large Vocabulary Continuous Speech Recognition - Tara Sainath
On the Computational and Statistical Interface and “Big Data” - Michael Jordan
Interpretable Machine Learning in Practice - Maya Gupta
Gaining Value From Data Where It’s Born - Ryan Peterson
Build a Foundation for Self-Service Data Prep, Analytics, and Governance - Oliver Claude
Connecting the Big-Data Driven Enterprise in Online Retail - Ashley Stirrup
Leading Telecommunications Company Uses BlueData to Spin Up Local, On Demand Hadoop and Spark Clusters to Enable Agile Deployment of Big Data Tools and Technologies - Nanda Vijaydev
Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing - Alan Wagner
Everything You Need To Know About HBase in 10 Minutes or Less - Alex Newman
The Emergence of the Data Refinery - Chuck Yarbrough
Big Data Cluster Planning and Optimization Using Wolf Island Simulation Technology - Laurent Isenegger
Prosthetic Implant Surgery - Where Big Data Means Big Savings - Rola Shaar
Close the Skills Gap and Deliver Rapid Business Value with Big Data Apps - Manan Goel
Distributed R - Scaling the R Language for Even Bigger Data - Sunil Venkayala
Transforming Big Data Landscape with Apache Spark - Rishi Yadav
Data Warehousing in the Cloud - Jon Bock
Proactive Product Intelligence for Electronics - Rami Lokas
Massive-Scale Security Incident Response Leveraging a Hadoop Architecture - Michael A. Davis
Don’t be a Hadoop Breach Headline - Discovery and Sensitive Data in Hadoop - Jeremy Stieglitz
Big Data vs. Climate Change - Srivatsan Ramanujam and John Cardente
ZEAS – Enabling anyone to create Hadoop Enterprise applications fast using a GUI - Aditya Agrawal
Power Tools for Big Data Analytics - Dan Steinberg
Big Data on OpenStack - Kirk Lewis and Frank Rego
Fighting ATM Fraud in Real Time with Hadoop Analytics - Christy Maver
Scale Big Data cost down, while scaling performance out. An NTT mobile personalization retrospective, re-thinking the Big Data solution stack. - Robert Greene
Dato Enables Large-Scale Deduplication at Zillow using GraphLab Create - Rajat Arya
To Catch a Thief with Big Data - Kevin Petrie
Jump into the Data Lake with Hadoop-Scale Data Integration - Greg Benson
Predicting The Future To Improve Customer Satisfaction - Joe Rossi
The Practical, Profitable Magic of Prescriptive Analytics - Andy Flint
Changing the Culture Around Data: Empowering More People with Analytics - Gary Cottrell
How Havas Media Found New Revenue Streams with UNIFi Software - Sean Keenan
What Enterprises Can Learn From Real-Time Bidding - Peter Corless
Big Data and the Data Quality Imperative - Ed Wrazen
Tapjoy Scales and Saves Costs with Riak - Tom Sigler
Smart Execution: How to Optimize Performance by Intelligently Leveraging Multiple Hadoop Analytics Engines - Matt Schumpert
Jagex Game Studio Case Study - Gregory McPhee
Supercharge Sqoop with magical JDBC drivers - Sumit Sarkar
Big Data Analytics: Diverse Use Cases, Diverse Architectures - Ben Conners
Accelerate your data with SequoiaDB - Tao Wang
Building reliable Hadoop clusters with two copies - Iyer Venkatesan
Start your Free Trial Self paced Go to the Course We have partnered with providers to bring you collection of courses, When you buy through links on our site, we may earn an affiliate commission from provider.
This site uses cookies. By continuing to use this website, you agree to their use.I Accept