Schedule: Operations sessions

Listed below are the preliminary Operations sessions and workshops confirmed to date. Additional sessions will be added as they are confirmed. Schedule subject to change.

Add to your personal schedule
Ballroom EFGH
Ron Pedde (Rackspace Hosting), Todd Willey (OpenStack), Matt Ray (Opscode)
Average rating: ***..
(3.50, 10 ratings)
OpenStack is a collection of open source technologies delivering a massively scalable cloud operating system. Read more.
Add to your personal schedule
Mission City
Adrian Cockcroft (Battery)
Average rating: ****.
(4.57, 28 ratings)
Details of why Netflix is running on AWS, what is running where and how it was done. Lot's of practical details on re-architecting large scale web services to run in the cloud, and new operational patterns. Read more.
Add to your personal schedule
Ballroom EFGH
Joshua Timberman (Opscode, Inc.), Adam Jacob (Chef), Christopher Brown (Opscode), Aaron Peterson (Opscode), Matt Ray (Opscode), Seth Chisamore (Opscode, Inc.)
Average rating: ***..
(3.60, 15 ratings)
System administrators love Chef because it gives them flexibility to integrate all aspects of their infrastructure such as monitoring and trending tools with applications. Software developers love Chef because it helps them take care of the muck so they can focus on writing great applications. Read more.
Add to your personal schedule
Mission City
Philip (Flip) Kromer (Infochimps, a CSC Big Data Business), Dennis Yang (Infochimps)
Average rating: *....
(1.96, 25 ratings)
Chef isn't just 'shell scripts but better', and Flume isn't just 'manages logs'. These are first-class superpowers for radically decoupling systems, and the more you decouple, the more benefits beyond the system diagram emerge. I'll show how these tools let you rethink team structure, hand interns the keys to the Ferrari, ignore every best practice in sight, and scale on a shoestring. Read more.
Add to your personal schedule
Ballroom EFGH
Philip Zeyliger (Cloudera)
Average rating: ***..
(3.36, 14 ratings)
What new challenges does managing and operating distributed systems pose? How is managing distributed systems fundamentally different than managing single-node machines? In this talk, we discuss the problems and work through the architecture of a system that eases lifecycle and configuration management of HDFS, MapReduce, HBase, and a few others. Read more.
Add to your personal schedule
Mission City
Jeff McCune (Puppet Labs)
Average rating: ***..
(3.20, 10 ratings)
Traditional methods of monitoring service performance have shortcomings due to the on-demand nature of cloud computing. Learn how to deploy MCollective to provide a self-organizing service monitoring and remediation. Read more.
Add to your personal schedule
Mission City
Jake Loomis (Yahoo!)
Average rating: ****.
(4.23, 13 ratings)
This year the Yahoo! frontpage went dark. Tweets rang out, "What happened? Yahoo! is never down. I can't remember the last time this happened." What enabled www.yahoo.com's remarkable record of reliability? This session will cover the top 5 tricks that contributed to the stability of Yahoo!'s frontpage and a description of what caused the eventual downtime. Read more.
Add to your personal schedule
Mission City
Robert Treat (OmniTI)
Average rating: ***..
(3.00, 6 ratings)
It's great to talk about bringing dev tools to ops teams, or making developer's more aware of production issues, but your database is where the rubber meets the road. Join me in this talk where I'll outline common strategies used in many of todays leading devops shops to keep things running. It's not just about dba's anymore. Read more.
Add to your personal schedule
Mission City
Justin Sheehy (Basho Technologies)
Average rating: ***..
(3.50, 10 ratings)
We will face the reality that anything we depend on can fail, and in the worst possible combination. We will learn how to build systems that let us happily sleep at night despite this troubling realization. Read more.
Add to your personal schedule
Mission City
Gene Kim (IT Revolution Press)
Average rating: ***..
(3.89, 9 ratings)
I'm going to share my top lessons of how great IT organizations simultaneously deliver stellar service levels and fast flow of new features into production. It requires creating a "super-tribe", where development, test, IT operations and information security genuinely work together to solve business objectives as opposed to throwing each under the bus. Read more.
Add to your personal schedule
Mission City
Mark Imbriaco (GitHub)
Average rating: ****.
(4.25, 12 ratings)
The Heroku platform grew from hosting 0 applications to more than 120,000 without ever buying a server. We'll talk about the overall architecture of the platform and dive into some of the operational consequences of those architectural choices. Read more.
Add to your personal schedule
Ballroom ABCD
Jason Cook (Wikia)
Average rating: ***..
(3.89, 9 ratings)
In the pursuit of making our site fast, efficient, and resilient in the face of failure we have chosen to use geographically dispersed data centers. In this session I will cover the tools and techniques we used to extend MediaWiki to render pages out of multiple geographically dispersed data centers. Read more.
Add to your personal schedule
Ballroom EFGH
Kate Matsudaira (SEOmoz)
Average rating: ****.
(4.11, 9 ratings)
What do you do when everything falls apart during your product launch? This talk will cover tools for surviving the worst, and making the most of it. We'll look at post-launch checklists to help recover rapidly, managing expectations to instill confidence in your customers and management, keeping up team morale, and successful post-mortems. Read more.
Add to your personal schedule
Ballroom ABCD
Bryan Cantrill (Joyent, Inc.)
Average rating: ****.
(4.93, 15 ratings)
We describe our experiences developing DTrace-based system visualization ofNode.js in an early production environment: the 2010 Node Knockout programmingcontest. We describe the challenges of instrumenting a distributed, dynamic, highly virtualized system -- and what our experiences taught us about the problem, the technologies used to tackle it, and promising approaches. Read more.
Add to your personal schedule
Ballroom ABCD
John Adams (Twitter)
Average rating: ***..
(3.88, 8 ratings)
Interconnecting many systems to express a single function is often a difficult task. What seems like a simple problem, becomes a difficult one when trying to serve millions of users at the same time without failure or downtime. Timeouts, redundancy, humans, and high levels of concurrency are a challenge, and I'll walk through problems and solutions encountered while scaling twitter.com. Read more.
Add to your personal schedule
Ballroom ABCD
Theo Schlossnagle (OmniTI/Circonus)
Average rating: ***..
(3.75, 4 ratings)
We all know that web performance is key. It drives adoption, loyalty and revenues. While it is so important, real-time web performance metrics are often absent from operations culture. Which metrics are important to operations? How can I use these metrics to improve performance, availability and stability? Learn how to make operational sense out of this critical performance indicators. Read more.
Add to your personal schedule
Ballroom ABCD
Mohit Lad (Thousand Eyes Inc)
Average rating: ***..
(3.17, 6 ratings)
How many locations should one serve content from to make their application truly fast globally? At what point does the improvement become marginal? These are questions that we answer in this talk using actual network performance data and arm the audience with the knowledge necessary to make informed decisions on network planning for optimizing application performance. Read more.
Add to your personal schedule
Ballroom EFGH
Jeremy Edberg (Netflix)
reddit.com does 1.3 billion pageviews a month, and that number grows by about 10% each month. Through that growth they've made some mistakes along the way. Some they have fixed, and some they are still paying the price. Come hear Jeremy Edberg, reddit's head of technology, speak about their successes and failures. Read more.
Add to your personal schedule
Ballroom EFGH
Moderated by:
Mandi Walls (Opscode)
Panelists:
Mark Imbriaco (GitHub), Matt Hackett (Tumblr), Teresa Dietrich (WebMD), Jake Loomis (Yahoo!)
Average rating: **...
(2.00, 1 rating)
A panel discussion featuring folks from sites which have had large outages in the past months. Discussion topics would cover causes, resolutions, process changes after the outage, cultural changes after the outage, and lingering after affects with regards to how the userbase reacted. Read more.
  • Keynote Systems
  • Cisco
  • Google
  • Neustar
  • Betfair
  • Cotendo
  • Rackspace Hosting
  • Akamai
  • Apica
  • dynaTrace
  • Equinix
  • Facebook
  • New Relic
  • Opscode
  • Salesforce.com
  • Yahoo! Inc.
  • AppDynamics
  • Aptimize
  • Blaze
  • CDNetworks
  • Cedexis
  • Citrix Systems
  • Compuware Corporation
  • Dyn Inc.
  • F5 Networks
  • Heroku
  • Percona
  • Quest Software
  • Schooner Information Technology
  • SiteSpect
  • Splunk
  • Strangeloop
  • WatchMouse
  • Zeus Technology
  • Neustar

Sponsorship Opportunities

For information on exhibition and sponsorship opportunities at the conference, contact Yvonne Romaine at yromaine@oreilly.com

Download the Velocity Sponsor/Exhibitor Prospectus

Contact Us

View a complete list of Velocity contacts