Decisions in the Face of Uncertainty or Just Enough Statistics to be Dangerous

John Rauser (Amazon)
Web Performance Ballroom ABCD
Please note: to attend, your registration must include Workshops.
Average rating: ****.
(4.04, 47 ratings)

How many machines will we need for Q4? How big should our next datacenter build be? What are the odds this project will be completed on time? What does a network packet loss rate of 1% mean for my application? Do you feel like you ought to have a working knowledge of statistics, but don’t know where to begin? If you took a statistics class in college, but have forgotten most of it, or you’ve never had any formal training in statistics, this class is for you. My goal is to give you a basic toolset to begin reasoning statistically through a series of interactive examples.

We’ll do an exercise in estimation, where you’ll learn how well calibrated you are as an estimator. We’ll use the data we get from the estimation exercise to explore one of the most useful ideas in statistics: the binomial distribution. We’ll conduct a key statistical test, the chi-square test, which use can to tell if the data you’re observing match what you’d expect or if the pattern is different. And last, we’ll solve a problem in decision theory and along the way learn about the normal distribution.

Though I’ll use a little bit of scary looking math, you should still attend if you’ve forgotten all the math you learned in school. I show how to use software like excel and sage to do all the mathematical heavy lifting.

Photo of John Rauser

John Rauser

Amazon

John has been extracting value from large datasets for over 15 years at companies ranging from hedge funds to small data-driven startups to amazon.com. He has deep experience in machine learning, data visualization, website performance and real-time fault analysis. An empiricist at heart, John’s optimism and can-do attitude make “Just do the experiment!” his favorite call to arms.

Comments on this page are now closed.

Comments

Picture of Suzanne Axtell
Suzanne Axtell
06/29/2011 10:31am PDT

Hi Nicolas, the video will be available to All Access and Online Access pass holders very soon.

Picture of Nicolas Poggi
Nicolas Poggi
06/29/2011 2:01am PDT

Wow, this must have been a great presentation, will there be video? from the slides alone it hard to get all the info.

Picture of Aaron Peters
Aaron Peters
06/20/2011 8:48pm PDT

I liked John’s presentation, but I think people could not really get anything practical and useful from it. The formulas were overwhelming. I think John is an awesome speaker and he should be on stage every year, but this was a bit too much ‘theory’

Picture of Sophia DeMartini
Sophia DeMartini
06/20/2011 9:09am PDT

The slides and handouts are all posted now – please leave a comment if you can’t see them for any reason.

-Sophia

Picture of H. "Waldo" Grunenwald
H. "Waldo" Grunenwald
06/20/2011 4:56am PDT

I found this talk highly entertaining, and an excellent intro to Statistics. While I didn’t get as much from this talk as I would have liked, there were some key takeaways (such as that Average is seldom actually useful and to attempt to give a range rather than a single answer). Regardless, the entertainment value was high.

martin wignall
06/19/2011 2:06pm PDT

Fantastic talk. Initially I wondered why I was in a statistics lecture, but actually (being an ex-mathematician anyway) realised that maybe I should dust off the old text books and apply this in my own work! Thank you for this wonderful exposition of a complex subject

Sathya Narayanan Nagarajan
06/17/2011 4:01pm PDT

It was great! I liked the way you presented!!!

Picture of John Rauser
John Rauser
06/17/2011 1:57pm PDT

@Ernest: Good feedback. When I’ve given the talk to smaller audiences with a bit more time there’s more breathing room to work with the audience to address specific problems they have, but in 90 minutes and a giant room, that didn’t happen. Perhaps I’ll prepare another talk that more strongly favors application specific depth.

@Tim: I’ve given the presentation materials to the folks at O’Reilly; I’ll ask to have them posted.

James Schmidt
06/17/2011 1:37pm PDT

John is an excellent presenter. This session gave me much to think about.

Tim Grant
06/17/2011 10:07am PDT

Will you be posting the slides and video of this session? I noticed that it was not in the Video and Slides section of the site… :(

Tim Grant
06/17/2011 10:06am PDT

John did an excellent job of clearly showing how to apply sophisticated statistical analyses to better diagnose complex situations. I wish my college statistics classes as clear cut… Thanks John

Picture of Ernest Mueller
Ernest Mueller
06/15/2011 9:40am PDT

I enjoyed this session; my colleagues and I wished there had been more examples relevant to WebOps/Perf to drive it home how/when you might apply these principles on the job.

Picture of Steve Souders
Steve Souders
05/19/2011 2:42pm PDT

John was one of the highest rated speakers at Velocity 2010. He’s a data mining magician full time at Amazon. Metrics are critical for web performance – without them you’re flying blind. We all know some basics, but John will help everyone step up to the next level at this workshop. This is a must attend.

  • Keynote Systems
  • Cisco
  • Google
  • Neustar
  • Betfair
  • Cotendo
  • Rackspace Hosting
  • Akamai
  • Apica
  • dynaTrace
  • Equinix
  • Facebook
  • New Relic
  • Opscode
  • Salesforce.com
  • Yahoo! Inc.
  • AppDynamics
  • Aptimize
  • Blaze
  • CDNetworks
  • Cedexis
  • Citrix Systems
  • Compuware Corporation
  • Dyn Inc.
  • F5 Networks
  • Heroku
  • Percona
  • Quest Software
  • Schooner Information Technology
  • SiteSpect
  • Splunk
  • Strangeloop
  • WatchMouse
  • Zeus Technology
  • Neustar

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Yvonne Romaine at yromaine@oreilly.com

Download the Velocity Sponsor/Exhibitor Prospectus

Contact Us

View a complete list of Velocity contacts