Strata Conference New York + Hadoop World 2014: Video Compilation
Video description
Use the power of big data to drive business strategy
What happens when cutting-edge data science and new business fundamentals intersect? Find out with this complete video compilation of Strata + Hadoop World 2014 in New York, where you’ll get a front-row seat to every keynote, workshop, and session.
Ten conference tracks were required to capture the most challenging problems and compelling opportunities in data …
Strata Conference New York + Hadoop World 2014: Video Compilation
Video description
Use the power of big data to drive business strategy
What happens when cutting-edge data science and new business fundamentals intersect? Find out with this complete video compilation of Strata + Hadoop World 2014 in New York, where you’ll get a front-row seat to every keynote, workshop, and session.
Ten conference tracks were required to capture the most challenging problems and compelling opportunities in data today, with presentations from Mike Olson (Cloudera), Kim Rees (Periscopic), Roger Magoulas (O'Reilly), Douglas Merrill (ZestFinance), Amanda Cox (The New York Times), and scores of other experienced data practitioners from finance, media, government, and education.
Download these videos or stream them through our HD player, and gain a clear perspective on the future of big data, including all the analytics, architectures, techniques, tools, and technologies you need to use data successfully.
Tracks include:
Business & Industry: How organizations of all sizes use data to make better decisions
Connected World: Navigating in an always-connected, always-on world
Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
Machine Data: Extracting meaningful insights from data collected and generated by things
Security: Fighting fraud, detecting threats, increasing trust—and securing data
Beyond Hadoop: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks
Deploying and Evaluating Data Products - Josh Levy
Design Interfaces
D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 1
D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 2
D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 3
D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 4
Visual Change: The Power of Scaled Data Visualization in Action - Nathan Shetterley, Joshua Patterson, Allan Enemark, and Kathleen Moynahan
The Future of Storytelling in Data Communication - Andrew Hill
Graphistry: Scaling Visual Exploration with GPUs and Design - Leo Meyerovich
Design and Data, A Human Centered Approach to Analysis, Experiment Design, and Visualization - Arianna McClain and Alisa Lemberg
Visualization Typography: Designing Legends, Labels, Titles, and Text - Trina Chiasson
Hadoop Beyond
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 1
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 2
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 3
Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 4
Tackling Data Curation in Three Generations - Michael Stonebraker
Advantages of a Domain-Specific Language Approach to Data Transformation - Joe Hellerstein and Sean Kandel
Stories from the Trenches: The Challenges of Building an Analytics Stack - Fangjin Yang and Xavier Léauté
Tachyon: A Memory Centric Storage System for Big Data Computing - Haoyuan Li
Anomaly Detection with Apache Spark - Sean Owen
Mixing Structured Data and Analytics with Spark SQL - Michael Armbrust
Interactive Visual Data Exploration with Spark - Hossein Falaki
Open Source Real Time BI using Storm, Hadoop, Titan, Druid D3 - Anil Madan
Highly Scalable Tile-Based Visualization for Exploratory Data Analysis - David Jonker and Rob Harper
Hadoop Platform
Building A Data Platform - Stephen O’Sullivan, John Akred, and Richard Williamson - Part 1
Building A Data Platform - Stephen O’Sullivan, John Akred, and Richard Williamson - Part 2
Building A Data Platform - Stephen O’Sullivan, John Akred, and Richard Williamson - Part 3
Building A Data Platform - Stephen O’Sullivan, John Akred, and Richard Williamson - Part 4
From Raw Data to Analytics with No ETL - Marcel Kornacker and Lenni Kuff
SQL on Everything, in Memory - Julian Hyde
From Oracle to Hadoop - Guy Harrison, David Robson, and Kathleen Ting
Hive on Apache Tez: Benchmarked at Yahoo! Scale - Mithun Radhakrishnan
Scaling Storm: Cluster Sizing and Performance Optimization - P. Taylor Goetz
Building Real-time Data Products at LinkedIn with Apache Samza - Martin Kleppmann
HBase: Where Online Meets Low Latency - Nick Dimiduk and Nicolas Liochon
Apache HBase Application Archetypes - Jonathan Hsieh and Lars George
Hadoop Operations - Best Practices from the Field - Chris Nauroth and Suresh Srinivas
Resource Management with YARN - Anubhav Dhoot
Bulk Loading Your Big Data into Apache HBase, a Full Walkthrough - Jean-Daniel Cryans
An Independent Comparison of Open Source SQL-on-Hadoop - Greg Rahn
Bringing PyData to Impala - Uri Laserson
Hadoop in Action
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 1
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 2
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 3
Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 4
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 1
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 2
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 3
Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 4
How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns
Customer Intelligence: Harnessing Elephants at Transamerica - Stephen Lloyd, Vishal Bamba, and David Beaudoin
Transitioning from Original Big Data to the New Big Data: L.L.Bean’s Journey - Chris Wilson and Doug Bryan
Unlocking Big Data at CERN - Matthias Braeger and Manish Devgan
Big Data Modeling: How FICO is Turning DBAs and into Data Engineers - Lelanie Moll, Deb Brooks, and Silaphet Mounkhaty
How LinkedIn Democratizes Big Data Visualization - Praveen Neppalli Naga, Chi-Yi Kuan, and Jonathan Wu
Better Care with Big Data: A Panel Discussion - Ryan Goldman, Ryan Brush, Sabrina Dahlgren, Aashima Gupta, and Michael Thompson
Renaissance in Medicine: Next-Generation Big Data Workloads - Allen Day
Image Processing on Hadoop - Ailey Crow
The Next Generation of Big Data in the Cloud - Daniel Weeks
Building an Enterprise Data Hub to Bridge the Gap Between Business and IT - Sabrina Dahlgren and Rajiv Synghal
Law, Ethics Open Data
Better Accountability Through Open Data - Merici Vinton and Micheál Keane
Wonk, Meet Geek - Jim Adler
You Have Zero Privacy, You Own Your Data, and Other Myths - Gilad Rosner
Homelessness Prevention by the Numbers - Stefan Heeke and Adeen Flinker
Why Big Data Needs Thick Data - Tricia Wang and Matt LeMay
Machine Data
Connectivity, Real-Time Data, and Edge Analytics to Enable Intelligent Machines for the Industrial Internet - Alisher Maksumov and Jean Lau
Data is a Local Problem - Alasdair Allan
Super Simple Internet of Things Backend: Persistence Post Hadoop with Crate Data - Jodok Batlogg
SmartCity StreamApp: An Internet of Things Service for Real-time Traffic Management - Damian Black
Security
Resolving Data Inaccuracy - Mike Armstrong
Big Data vs Zombies: Using Algorithms, Big Data, and Large Scale Distributed Processing to Combat Identity Fraud - Jesse Shaw
Why Should Anyone Care at All about Privacy, Privacy Engineering, or Data? - Michelle Dennedy
Real-Time Cyber Threat Detection with Sqrrl and Spark - Adam Fuchs
Big Data Framework for Anomaly Detection Root Cause Analysis on Streaming Time Series Data - Roy Singh
Enterprise Adoption
In the Data Lake - Barry Devlin
Unseating the Giants - Monte Zweben
What’s Holding Up Your Hadoop? - Eddie Garcia
Spark Camp
Spark Camp - Paco Nathan and Patrick Wendell - Part 1
Spark Camp - Michael Armbrust - Part 2
Spark Camp - Joseph Bradley - Part 3
Spark Camp - Tathagata Das - Part 4
Spark Camp - Sameer Farooqui and Holden Karau - Part 5
Spark Camp - Sameer Farooqui and Holden Karau - Part 6
Spark Camp - Sameer Farooqui and Holden Karau - Part 7
Spark Camp - Sameer Farooqui and Holden Karau - Part 8
Hardcore Data Science
Doing the Impossible (Almost) - Ted Dunning
Tupleware: Redefining Modern Analytics - Tim Kraska
Data Science for Humans, Not Robots - Alice Zheng
Big Data: Efficient Collection and Processing - Anna Gilbert
Computational Problems in Managing Social Information - Jon Kleinberg
Small Data Problems - Kira Radinsky
Building and Deploying Large-scale Machine Learning Pipelines Using the Berkeley Data Analytics Stack - Ben Recht
Learning About Music and Listeners - Brian Whitman
Statistical Topic Modeling - Hanna Wallach
The Aha! Moment: From Data to Insight - Dafna Shahaf
Data-Driven Business Day
Designing for Interruption - Alistair Croll
Check Your Bias, Feed Your Empathy - Farrah Bostic
The Data Lake Dream - Edd Dumbill
Why Marketing’s Approach to Big Data is All Wrong - Jennifer Zeszut
Bigger is Better, but at What Cost? Towards Understanding the Economic Value of Data - Brian d’Alessandro
The Sounds of (Data) Silence - Jana Eggers
Panel: Deciding Better - Joe Caserta, Farrah Bostic, and Halle Tecco
Making Strategic Decisions: Business Requirements for Analytics Projects - Joy Beatty
The Future of Data - Kim Rees
How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns
The Big (Data) Picture - Rohit Jain
Improving Healthcare Business Strategies through Lean Data Partnerships - Brigitte Piniewski
Building with Data: Lessons from Etsy - Nellwyn Thomas
Reducing Employee Turnover by 75%: Applying Data and Predictive Analytics to Hiring and Team Assembly - Michael Rosenbaum
Better Accountability Through Open Data - Merici Vinton
The Unit: Building Data Science Teams the Special Operations Way - Amy Gaskins
MapReduce ETL Processing for Healthcare Process Improvement Dashboards - Mary Ann Wayer
Industrial Internet
Industrial Internet Day Opening Remarks - Jon Bruner
Taking the Industrial Internet to the Ends of the Earth - Daniel Koffler
Oceans 2.0: The Last Remaining Wild West - Ami Daniel
Big Data Analytics: Enabling Innovation while Reducing Risk - David Simchi-Levi
Video Analytics in the Big Fast Streaming Data Era - Victor Fang and Yu Cao
The Industrial Internet and the Data Revolution - Nathan Oostendorp
Bring Your Own Internet (of Things) - Alasdair Allan
IIOT Applied: 10 Things I Learned While Deploying an IIoT Machine Learning System - Cameron Turner
Industrial Internet Day Closing Panel - Jon Bruner, Leo Spiegel, Edy Liongosari, and Mark Grabb
PyData at Strata
IPython - Brian Granger and Fernando Pérez
Collaborative Data Science with coLaboratory - Kayur Patel and Kester Tong
Intro to NumPy and matplotlib - Jake Vanderplas - Part 1
Intro to NumPy and matplotlib - Jake Vanderplas - Part 2
Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 1
Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 2
Visualizing Data with Blaze and Bokeh - Andy Terrel
Interactive Visualization with Bokeh - Peter Wang
SciPy – An Exploration of the Most Useful Bits - Travis Oliphant - Part 1
SciPy – An Exploration of the Most Useful Bits - Travis Oliphant - Part 2
New and Upcoming Features in Pandas - Wes McKinney
High Performance Python - Trent Nelson
Sponsored
Got the T-shirt: Real Experiences from a Hadoop Veteran - Jim Scott
See the Fastest Spark-Powered Disparate Data Blending Analysis Solution - Vaibhav Nivargi
Disrupting the Traditional Analyst Workflow with Platfora and Spark - Peter Schlampp and Ed Smith
Big Data Architectural Patterns - Todd Papaioannou
An End-to-End Approach to Offloading the Data Warehouse with Hadoop - Jorge A Lopez
Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar and Brett Rudenstein
Using Graph to Discover Unseen Relationships in Big Data - Mike Hoskins
Hadoop Effortlessly: A Data Inventory is Key to Data Self-service - Moderated by: Alex Gorelik - Panelists: Suresh Srinivas, Mike Sutten, John Mount, Clark Farrey, and Sunil Soares
Building Real-Time Platforms with MemSQL and Apache Spark - Eric Frenkiel
Unlocking Hadoop’s Potential with YARN - Sanjay Radia
Real-time streaming and analytics with Amazon Elastic MapReduce and Amazon Kinesis - Steve McPherson
NoSQL Solutions for Big Data Problems - Don Pinto
Big Data SQL and Query Franchising: An Architecture for SQL Beyond Hadoop - Dan McClary
Drive Data Quality at Your Company: Create a Data Lake - George Corugedo
Important Advances in Hadoop: A Panel Discussion - Joey Jablonski, Armando Costa, Jim Burmingham, and Rob Johnson
Cloud Machine Learning - Joseph Sirosh
Embracing Diversity - Sid Sipes
The Art of Prediction: Seamless Visualization and Modeling With Hadoop - Adam Pilz
Extending “Variety” of Data to “Variety” of Users - Tina Groves
How to Architect Big Data Apps with the Lambda Architecture - with Real Work Examples on Merging Batch and Real-Time Processing - Altan Khendup and Ron Bodkin
What do Al Capone Hadoop Have in Common? Visualizing Data at Scale – Making Sense Out of Big Data - James Dixon
Distributed R - A Scalable and High-performance Platform for R - Sunil Venkayala and Indrajit Roy
Getting Big Data to Work: Agile Data Transformation in Hadoop - Stephanie McReynolds, Xavier Quintuna, Shirshanka Das, Charlie Crocker, and Anna Dorofiyenko
Now Playing at Netflix: Advanced Decision-Making with Hadoop, Starring MicroStrategy - Michael Hiskey
Analytics the Way Nature Intended - Donald Farmer
Western Union: Implementing a Hadoop-based Enterprise Data Hub with Informatica - Pravin Darbare and Sumeet Agrawal
For Red Hat, it’s 1994 All Over Again - Sarangan Rangachari
Hadoop Responsibly with Big Data Governance - Moderated by: Barry Devlin - Panelists: Sunil Soares, Joseph Dossantos, and Jay Zaidi
Big Content: Finding the Why Behind the What - Sid Probstein
Solutions Showcase Theater
Innovative Healthcare, Tech Retail Companies Mix CRM Info with Big Data to Make Reps 10x More Productive, 40x More Useful and 30% More Profitable - Michael Hiskey
Real-time Classification and Sentiment Analysis of Multi-lingual Content Using Advanced Analytics on Apache Storm - Anand Venugopal
Hadoop at Bloomberg - Sudarshan Kadambi
EVP Data Lake: Store Everything, Analyze Anything, Build What You Need - Ryan Peterson
10 Amazing Things to do With A Hadoop-based Data Lake - Greg Chase
Solve Data Ingest Limitation with High Performance Networks Offloads - Asaf Wachtel
Real-Time Big Data Architecture @ LivePerson - Shane K. Johnson
From Infrastructure to Data Applications - Jonathan Gray
From Big Iron to Big Data: Offloading Data Workloads to Hadoop at a Major US Bank - Jorge A. Lopez
Managing Data in Regulated Industries - Jim Clark
The Pain Curve - Lack of Automation Leads to Failure - Greg Bruno
Building the Enterprise Data Hub - Joe Caserta
QlikView and Big Data Analytics at King - Donald Farmer
Driving Growth in Transportation Using Big Data and Data Science - Marie Goodell
Competitiveness in the Age of Big Data - Satyendra Rana
Unraveling Hadoop’s Meltdown Mysteries - Sean Suchter
Let’s Stop Pretending that One Size Fits All When it Comes to the Challenges of Working with Enterprise Data - Nenshad Bardoliwalla
Waking Analysts from their Nightmare - George Corugedo
“Mining” the IoT for Business Value: How WWT Helped One of the Largest Mining Companies Predict Engine Failures - Yoni Malchi
All Hands on Deck: How to Get Non-technical Business Users to Tackle Big Data so you Can Focus on Complex Queries - Amit Bendov
The Spark-Inspired Workflow - Kevin Beyer
Do you Prefer to Hike up Machu Pichu or Take the Train? - Todd Goldman
Using Big Data to Improve Patient Outcomes - John Armstrong
Get Real with Hadoop - Jim Scott
Big Data Analytics Heavyweight Sounds Off on Financial Services Use Cases - Matt Schumpert
Real World Showcase of How a Retail Customer Uses and Can Use Microsoft Big Data and Business Analytics Technologies - Sanjay Soni
Using Hadoop to Run Real-Time, Operational Applications - Rich Reimer
Automated Data Inventory for Hadoop - Oliver Claude
Keys to Optimizing Product Inventory and Pricing at One of the Largest Global Retailers - Julien Sauvage
Consumer Behavior Analytics with Cubes on Hadoop - Ajay Anand
Omneo’s Enterprise Data Hub: Helping Manufacturers Save Millions - Kathleen deValk
Building an Enterprise Grade Big Data Risk Management Solution for Financial Services - Vamsi Chemitiganti
Orange Silicon Valley spins up private Big Data as a Service with BlueData to create on-demand Spark and Hadoop Clusters - Tom Phelan
Everything You Don’t Know About HBase in 10 Minutes or Less - Alex Newman
Big Data News Cases… What in the World are People Doing with Hadoop? - Gord Sissons
Build Intelligent Applications with H20’s Open Source - Joel Horwitz
NoSQL Key Value Stores - The Key to Velocity - Brian Bulkowski
Java Big Data in Real Time - Matt Schuetze
Using Operational Intelligence to Track 10M Cable TV Viewers in Real Time - Dr. William Bain
Unlock the Value of Big Data with Hunk for Hadoop - Adrish Sannyasi
Big Cybersecurity Data for Insider Threat Analysis - Joe Travaglini
Customer Spotlight: Big Data, The Elephant and the Bear - Lawrence Schwartz
Case Study: Improving Customer Experience by Employing Big Data Technologies in the Banking Industry - Martin Triska
Better Manufacturing with Data: Using 3D Visual Analytics on the Shop Floor - Carl Byers
MemSQL Shutterstock: Insights in Real Time - Eric Frenkiel and Chris Fischer
Running In-Memory Jobs and Traditional Jobs on the Same Hadoop Cluster - David Chaiken
Data Transformation on Hadoop: Balancing Technology and Human Needs to Boost Performance and Increase ROI - Ravi Hubbly
Big Data Analytics / IoT: New Customer Insights Using Network Data - Ankur Gupta
Extending Enterprise Data Security to Hadoop - Raul Ortega
Industrialized Hadoop Analytics and SQL: Unleashing the Business User - John Santaferraro
Connection Analytics: Extracting Value from Social Networks Data - Sri Raghavan
The Emergence of the Streamlined Data Refinery - Chuck Yarbrough
Hardware Still Matters: Manageable Infrastructure Platforms for Dynamic Big Data Environments - Robert Novak
Start your Free Trial Self paced Go to the Course We have partnered with providers to bring you collection of courses, When you buy through links on our site, we may earn an affiliate commission from provider.
This site uses cookies. By continuing to use this website, you agree to their use.I Accept