Video description
The future clearly belongs to those who understand how to collect and use their data successfully—and the time to get started is now. At the Strata Conference, people from tech, marketing, and many other fields gathered to learn the latest skills, tools, and technologies for making a data-driven business work. This video compilation offers you a front row seat for every tutorial, session, and keynote at the conference.
View thought-provoking keynotes from industry leaders such as Avanish Kaushik (Market Motive), Coco Krumme (MIT Media Lab), Dave Campbell (Microsoft), and Doug Cutting (Cloudera). Then sit back and take in practical and inspiring sessions in seven different tracks: Data Science, Business & Industry, Visualization & Interface, Hadoop & Big Data, Policy & Privacy, and Domain Data.
You’ll also get the complete Strata Jumpstart, a day-long bootcamp for business leaders who want to become data driven. Download these videos or view them through our HD player, and discover how the world of big data can—and will—affect your organization.
Here are just a few of the sessions you’ll receive in this video package:
Data Science:
- The Two Most Important Algorithms in Predictive Modeling Today—Jeremy Howard (Kaggle), Mike Bowles (Sole Proprietor)
- Architecting Virtualized Infrastructure for Big Data—Richard McDougall (VMware)
Business & Industry:
- Data Jujitsu: The Art of Turning Data into Product—DJ Patil (Greylock Partners)
- Improving Productivity Using Real-Time Data—Jacomo Corbo (QuantumBlack)
Visualization & Interface:
- Science of Visualization—Jock Mackinlay (Tableau Software)
- Roll Your Own Front End: A Survey of Creative Coding Frameworks—Michael Edgcumbe (Columbia University), Eric Mika (The Department of Objects)
Hadoop & Big Data:
- I Didn't Know You Could Do All that with Hadoop—Jack Norris (MapR)
- Storm: distributed and fault-tolerant realtime computation—Nathan Marz (Twitter)
Domain Data:
- Understanding Social Contagion—Marcel Salathé (Penn State University)
- Changing Data Standards from Wall Street to DC and Beyond—John Mulholland (Fannie Mae)
Table of Contents
Strata Conference 2012: Day 1
SQL and NoSQL Are Two Sides Of The Same Coin
From Knowing What To Understanding Why
The Model and the Train Wreck: A Training Data How-to
Corpus Bootstrapping with NLTK
The Importance of Importance: An Introduction to Feature Selection
Social Network Analysis Isn’t Just For People
Array Theory vs. Set Theory in Managing Data
Survival Analysis for Cache Time-to-Live Optimization
The Data Science Debate
Introduction to Apache Hadoop Part 1
Introduction to Apache Hadoop Part 2
Introduction to Apache Hadoop Part 3
Introduction to Apache Hadoop Part 4
The Two Most Important Algorithms in Predictive Modeling Today Part 1
The Two Most Important Algorithms in Predictive Modeling Today Part 2
The Two Most Important Algorithms in Predictive Modeling Today Part 3
The Two Most Important Algorithms in Predictive Modeling Today Part 4
Large scale web mining Part 1
Large scale web mining Part 2
Large scale web mining Part 3
The Craft of Data Journalism Part 1
The Craft of Data Journalism Part 2
The Craft of Data Journalism Part 3
The Craft of Data Journalism Part 4
Big Data Without the Heavy Lifting Part 1
Big Data Without the Heavy Lifting Part 2
Big Data Without the Heavy Lifting Part 3
Big Data Without the Heavy Lifting Part 4
Big Data Entity Extraction With Less Work and Less Code Part 1
Big Data Entity Extraction With Less Work and Less Code Part 2
Big Data Entity Extraction With Less Work and Less Code Part 3
Big Data Entity Extraction With Less Work and Less Code Part 4
Introduction to R for Data Mining Part 1
Introduction to R for Data Mining Part 2
Introduction to R for Data Mining Part 3
Introduction to R for Data Mining Part 4
Building Applications with Apache Cassandra Part 1
Building Applications with Apache Cassandra Part 2
Building Applications with Apache Cassandra Part 3
Building Applications with Apache Cassandra Part 4
Hadoop Data Warehousing with Hive Part 1
Hadoop Data Warehousing with Hive Part 2
Hadoop Data Warehousing with Hive Part 3
Hadoop Data Warehousing with Hive Part 4
Hands-on Visualization with Tableau Part 1
Hands-on Visualization with Tableau Part 2
Hands-on Visualization with Tableau Part 3
Hands-on Visualization with Tableau Part 4
Designing Data Visualizations Workshop Part 1
Designing Data Visualizations Workshop Part 2
Designing Data Visualizations Workshop Part 3
Designing Data Visualizations Workshop Part 4
Developing applications for Apache Hadoop Part 1
Developing applications for Apache Hadoop Part 2
Developing applications for Apache Hadoop Part 3
Developing applications for Apache Hadoop Part 4
What Marketers Can Learn From Analysts
Jumpstart Welcome
Big Data and Supply Chain Management: Evolution or Disruptive Force?
Ammunition for the CFO: How to be a Hard-Nosed Business Customer for Analytics
3 Essential Skills of a Data Driven CEO
Business Intelligence: What have we been missing?
Do it Right: Proven Techniques for Exploiting Big Data Analytics
The Business of Big Data
Big Data, Serious Games, and the Future of Work
It’s Not Just About the Data……the Power of Driving Impact Through Intent and Interconnectedness
Wrap-up Session
Strata Conference 2012: Day 2
The Apache Hadoop Ecosystem
Decoding the Great American ZIP myth
Guns, Drugs and Oil: Attacking Big Problems with Big Data
Machine Learning and Big Data: Sustainable Value or Hype?
Learning Analytics: What Could You Do With Five Orders of Magnitude More Data About Learning?
A Big Data Imperative: Driving Big Action
The Information Architecture of Medicine is Broken
Do We Have The Tools We Need To Navigate The New World Of Data?
Street Fighting Data Science
Data Ingest, Linking, and Data Integration via Automatic Code Generation
Disambiguation: Embrace wrong answers and find truth
Netflix recommendations: beyond the 5 stars
Data Science in Product Development
Mo’ Data, Mo’ Problems
Business Management Strategies for Big Data
Becoming a Data-Driven Organization
Building a Data Strategy: Data Enabling Toys at Leapfrog
Analytics in a Community-Driven Fashion Retailer
Data Science in Marketing Analytics
Science of Visualization
Effective Data Visualization
Building a Data Narrative: Discovering Haight Street
Crafting Meaningful Data Experiences
Roll Your Own Front End: A Survey of Creative Coding Frameworks
Sketching With Data
The Future of Hadoop: Becoming an Enterprise Standard
Hadoop + JavaScript: what we learned
Architecting Virtualized Infrastructure for Big Data
Aggregating and serving local places data and ads at Citygrid
Exploring Social Data: Use Cases for Real-World Application
Understanding Social Contagion
Changing Data Standards from Wall Street to DC and Beyond
Big Data: Wall Street Style
Big Data = Bigger Metadata
Linked Data: Turning the Web into a Context Graph
Data as a Strategic Weapon - Walmart, Netfix and Apigee Panel Discussion
Creating Real Business Value with Big Data Analytics
Getting the Most from Your Hadoop Big Data Cluster
Amazon DynamoDB: A seamlessly scalable NoSQL service
Turning Big Data Into Competitive Advantage
Unleash Insights On All Data With Microsoft Big Data
SQLFire - An Ultra-fast, Memory-optimized Distributed SQL Database
MapReduce for the Rest of Us: Unlocking Data Science for the Business User
Automated Understanding - The Next Evolution in Big Data Analytics
RHadoop, R meets Hadoop
Monitoring Apache Hadoop - a big data problem?
How to develop Big Data Pipelines for Hadoop
How Crunch Makes Writing, Testing and Running of MapReduce Pipelines Easy, Efficient and Even Fun!
Analyzing Hadoop Source Code with Hadoop
Strata 2012 Startup Showcase
Strata Conference 2012: Day 3
Democratization of Data Platforms
5 Big Questions about Big Data
The Trouble with Taste
Embrace the Chaos
Open Data and the Internet of Things
Big Data’s Next Step: Applications
Heritage Provider Network, Announces the Winner of the Second Heritage Health Progress Prize
Using Google Data for Short-term Economic Forecasting
Is this normal? Finding anomalies in real-time data
From Predictive Modeling to Optimization: The Next Frontier
Mining Unstructured Data: Practical Applications
Migratory data: the distributed data you carry with you
Humans, Machines, and the Dimensions of Microwork
Big Data and Bibliometrics: Crowdsourcing the World’s Largest Database of Research
Democratizing BI at Microsoft: 40,000 Users and Counting
Mining the Eventbrite Social Graph for Recommending Events
Data Jujitsu: The Art of Turning Data into Product
Data Marketplaces for your extended enterprise: Why Corporations Need These to Gain Value from Their Data
Big Data Meets Big Weather
Improving Productivity Using Real-Time Data
Video Graphics - Engaging and Informing
Rich Sports Data and Augmented Reality
Visualizing Geo Data
Beautiful Vectors: Emerging Geospatial technologies in the browser
From Big Data to Big Insights
Exploring the Stories Behind the Data
Hadoop Analytics in Financial Services
Using Map/Reduce To Speed Analysis of Video Surveillance
Beyond Map/Reduce: Getting Creative With Parallel Processing
Petabyte Scale, Automated Support for Remote Devices
Big Analytics Beyond the Elephants
If Data Wants to Be Free, is Privacy a Prison?
Pretty Simple Data Privacy
OODA Loop: How to Understand the Use Cases for Big Data
It’s Not Junk [Data] Anymore
Big Data for the Common Good
Personalized Medicine and Individual Cancer Care, it is a data problem
Solving big data analytics with an emerging data-centric language
Big Data and Machine Learning: A Reality Check
Big Data Big Costs?
Big Data Meets the Big Cloud: How To Monitor Thousands of Servers
Big Data and the Social Firehose
Big Data Applications in Action
Start Innovating! Crowdsourcing and Big Data
Apache Cassandra: NoSQL Applications in the Enterprise Today
Storm: distributed and fault-tolerant realtime computation
Analytics from 330 million smartphones
Connecting Millions of Mobile Devices to the Cloud
Open Source Ceph Storage: Scaling from Gigabytes to Exabytes with Intelligent Nodes
Mapping social media networks (with no coding) using NodeXL