Build resilient systems at scale
May 27–29, 2015 • Santa Clara, CA

Speaker Slides & Video

Presentation slides will be made available after the session has concluded and the speaker has given us the files. Check back if you don't see the file you're looking for—it might be available later! (However, please note some speakers choose not to share their presentations.)

Dustin Whittle (AppDynamics)
Slides:   1-PDF 
The AppDynamics Application Intelligence Platform offers visibility into the workings of distributed applications, with real-time analytics underpinning the data collection. AppDynamics Unified Monitoring encompasses user experience, application, and infrastructure component availability and performance monitoring.
Ross Clanton (Target), Heather Mickman (Target)
Slides:   external link
Target has been focused on (re)building an engineering culture for over a year. One of the most important components has been a focus on a DevOps transformation. This talk will reflect on our learnings over the past year, and approaches we are experimenting with moving forward. We will share tips and insights on driving this kind of change in a large traditional IT organization.
Indi Young (myself)
Slides:   1-PPTX 
Empathy is a popular buzzword that gets equal positive and negative attention in business media. The reason for the debate is that empathy is not well defined. People with budget responsibility need reliable knowledge, not 'increased sensitivity.' Fortunately, _cognitive_ empathy represents a practical, repeatable approach to cultivating insight.
Dave McCrory (Basho Technologies)
Slides:   1-PPTX 
In this session, Dave McCrory, CTO of Basho Technologies, will discuss steps organizations can take to solve challenges of integration and operational challenges for active workloads, passive workloads, and backend analysis, as well as how innovations in distributed systems can simplify today’s complex management systems.
Eddie Canales (Manta Media)
Slides:   external link
Have you ever had the experience of trying to speed up your pages, then realizing that you're only responsible for 13 of the 187 HTTP requests? It's like getting ready to tune up your drag car, then walking around the back to see that the parachute can't be put back in.
Vanessa Meyer (Load Impact)
Slides:   1-PDF 
While many see APIs as just a technical concept, they obviously lack appreciation for the rising strategic business significance of APIs. APIs allow companies to grow at unprecedented rates, and help many to expand into new markets. From a business perspective, APIs are a new tool in the product marketer's toolbox. That may mean developers will increasingly take orders from marketing.
Jon Correll (Conversion Voodoo / VoodooAlerts / Maxly)
Slides:   1-PPTX 
CRO (Conversion rate optimization) is no longer just a front end UI buzzword, yet most companies are still failing to bridge the gap between tech and marketing teams to provide the user with the best end-to-end experience. Learn how to break down the walls between your teams and bring a culture of optimization to your entire business.
Shane Evans (Hewlett-Packard)
Slides:   1-PPTX 
Join HP as Shane Evans explores the keys to succeed in the enterprise, and which tools, methodologies, and strategy are needed to bring the enterprise into the 21st Century.
Laine Campbell (Pythian)
Slides:   1-PDF 
Operational visibility is more than simply monitoring and graphing. In this tutorial, we will discuss theory and execution of this key pillar of operational excellence, from business requirements to user story to collection, analysis, storage, and visualization. Additionally, we will be sharing our easily-deployed, open source OpsViz stack, available for AWS CloudFormation, accessible by GitHub.
Alastair Simpson (Atlassian)
Slides:   external link
Combine growth hacking, user research, data analytics, and A/B testing at scale to quickly optimize customer and prospect experiences. No need to wait for development teams or interrupt their schedules. Empowered by data, growth teams can make early assumptions about customer needs, build minimum viable experiences, and then run usability and growth experiments to validate those assumptions.
Amiya Gupta (Microsoft)
Slides:   external link
MSN.com shipped a major new release last fall. This talk covers the design principles, challenges encountered, and optimization techniques applied while chasing the goal of great perceived performance. Topics include layout thrashing, navigation and resource timings, responsive design, custom fonts and more.
Ralph Bodenner (New Relic)
Slides:   external link
For an engineering team to sustain its culture as it grows, even more important than what processes it adopts is how those processes undergo change. At New Relic, we have a culture of openness that we intend to keep, so we've been using the same process to change our culture that we use to change our code: GitHub Pull Requests. Learn why and how we opened up our Engineering Handbook.
Seth Vargo (HashiCorp)
Slides:   1-PDF 
Consul by HashiCorp is an open source tool for service discovery, monitoring, and infrastructure configuration. Simple configuration and powerful features like high availability, failure detection, and multi-datacenter awareness make Consul a great solution for organizations of all sizes trying to scale their monitoring.
Sunil Shah (Mesosphere), Jose Armando Garcia (Mesosphere)
Slides:   external link
In this hands-on workshop, you will learn how to build a multi-container application using the Datacenter Operating System (DCOS). DCOS allows easy deployment and management of datacenter workloads, while significantly increasing utilization. In this workshop, we’ll cover how to deploy typical containerized service and batch workloads across a cluster of machines using the DCOS.
Slides:   external link
Etsy has recently developed new alerting tools to help discover and dig into performance regressions across the site. Allison McKnight, performance engineer at Etsy, will cover how we built these tools on top of existing technology, how we iterated on the context included in our alerts, and how these tools have changed the way that we collaborate with other teams to fix performance regressions.
Laine Campbell (Pythian)
Slides:   1-PDF 
IT is undergoing a revolution, and database administration is no exception. As Ops teams evolve into reliability engineers, developers and traditional systems administrators find themselves diving into the world of the DBA. In this session, we take the model of site reliability engineering and guide you through the components of the craft of DB architecture/operations in that context.
Sheldon Monteiro (SapientNitro)
Slides:   1-PDF 
Marketing and IT need each other, and in successful organizations, a hybrid technologist is “minding the gap.” We will decode the DNA of the marketing technologist - an activist engineer - part creative, part marketer, part ad-woman, solid technologist, with visionary leadership and influence skills, who drives change at the intersection of story and technology.
Steve Souders (SpeedCurve)
Slides:   external link
Users want a fast website with a rich design, but sometimes the interplay between design and performance feels like a fixed sum game: One side's gain is the other side's loss. Design and performance are indeed connected, but it's more like the yin and yang. They aren't opposing forces, but instead complement each other. Bringing these processes together produces experiences that are rich and fast.
Patrick McKenna (GitHub)
Slides:   1-PDF 
GitHub integrates with a wide spectrum of third party tools, giving you considerable flexibility when setting up a deployment pipeline. So where do you begin? In this tutorial, we will give a demonstration how one uses these integrations in practice. You will learn how to implement CI and CD incrementally, so you can start building, or iterate on, your organization's pipeline.
Jay Edwards (Meangrape, Inc.)
Slides:   1-PDF 
Distributed systems is a term that is starting to include computing in its most general case. Multi-threaded, multi-core, NUMA systems have a lot in common with multi-datacenter, multi-server systems. This commonality is what makes the study of distributed systems a valuable lingua franca. During this seminar you will learn about the various fundamental underlying principles of distributed design.
Slides:   1-PDF 
In this one-day introductory training, you will identify actionable strategies and tools that can be used to leverage DevOps to implement noticeable, long-lasting improvements to your culture and development processes in your environment, regardless of your level and role.
Steve Hoffman (Orbitz Worldwide), Rick Fast (Orbitz Worldwide)
Slides:   1-ZIP 
In this talk we will discuss how we enabled decomposition of one of our 250+ system components into a continously deployed microservice cluster using Docker, Consul, the ELK stack, and Graphite. We will discuss the architecture and supporting services as well as the continous delivery from source to production via Ansible and Jenkins.
Astrid Atkinson (Google)
Slides:   1-PPTX 
As distributed systems grow, they differentiate - growing dependencies, subsystems, and support infrastructure. How do you keep complexity in check without stifling innovation?
Bruce Lawson (Opera ASA)
Slides:   external link
The next billion people online will overwhelmingly be on mobile devices, and the vast majority will be in emerging economies where devices and networks are constrained. Although you might not know it now, many of the next billion web users will become your customers - or your competitors'. What do you need to know about this billion to ensure your sites perform well for them?
Ian Malpass (Etsy)
Slides:   1-PDF 
Failure is inevitable. Wait! Come back! It's OK. If things are going to go wrong, what do you do? I'll tell you about Etsy's approach to failure: how it influences our tools and our philosophy, and how we try to minimise the cost of failure.
Ariel Tseitlin (Scale Venture Partners)
Slides:   1-PPTX 
People on the business side of the equation often hear about approaches like continuous deployment, and think they have risk written all over them. But the opposite turns out to be true.
Jeff Sussna (Ingineering.IT)
Slides:   1-PDF 
The complex, co-creative nature of digital services means we can’t fully know how our designs will work until we deploy them. Operations needs to be an input to design as well as an output. This talk will present continuous design as an extension of DevOps, and describe concrete ways to create a circular design-operations loop that unifies marketing, design, development, operations, and support.
Patrick Lightbody (New Relic)
Slides:   1-PPTX 
For years we've been hearing WHY software is disrupting industries. But little on HOW to determine how to invest the scarcest resources in the software economy: developer talent. If you're going to compete as a software company, you gotta listen to your apps. We tell you how.
Paddy Ganti (Instart Logic), Mehrdad Reshadi (Instart Logic)
Slides:   1-PPTX 
In an effort to keep up with dynamic and responsive web trends, the amount of first and third party JavaScript usage has tripled over the last three years. The result is bloated web pages that create a drag on performance. This session will focus on a set of new solutions that makes first and third party JavaScript code load faster, speeding up page load times and improving customer experience.
Wilfried Schobeiri (MediaMath)
Slides:   1-PPTX 
In the blink of an eye, MediaMath’s digital marketing systems host real-time auctions and serve ads across the world to the tune of 2.1 million QPS. And we are building the next generation of these systems on Go. In this talk, we’ll explore how to maximize Go’s performance, how to use Go to handle hundreds of thousands of requests per second, and why Go is a game-changer for building services.
Ritesh Maheshwari (LinkedIn)
Slides:   external link
In this talk, we describe how LinkedIn used RUM (real user monitoring), beyond its monitoring use-case, to improve content download times of the LinkedIn site by up to 25%. We describe the techniques used by LinkedIn to drive many optimizations using RUM; and show how they can be used by others to solve similar problems.
Michael Brunton-Spall (Government Digital Service)
Slides:   1-PDF 
As we move towards architectures designed to cope with changing requirements, and eternal services that go live and iterate, how can we manage change in a secure way? How can we possibly build secure systems in this environment?
Eric Lawrence (Telerik)
Slides:   1-PPT    external link
Securing your websites and services using HTTPS has never been more important, or more complicated. Learn the best practices for using HTTPS today, from ciphers and hash algorithms to new browser features like HSTS and PKP. Explore how attackers circumvent HTTPS, and what you can do to help protect your visitors.
Karl Matthias (New Relic), Sean Kane (New Relic)
Slides:   1-PDF 
At New Relic we’ve delivered production apps via Docker containers for over a year, to significantly streamline app delivery workflow from development through production. We’ll explain choices we made to implement a production Docker workflow, and demo tools that allow you to build Docker clusters in multiple datacenters and monitor your containers across those environments.
Norm Warren (Ancestry Inc), Jeremy Johnson (Ancestry.com)
Slides:   1-PDF 
Influencing a developer-centric company to care more about site performance is not a small task. We will walk through successful approaches to performance monitoring at Ancestry.com. These include collecting internal SLA statistics and implementing a grading system that focuses on the top 30 pages while correlating with business data.
Dana Quinn (Intuit)
Slides:   1-PPTX 
Companies are turning to the public cloud for all the ways it makes life easier for developers. The business side of the house likes it too, since it shortens the time from idea to implementation. But the shift to any new technology must be handled with care. Learn hard-earned insights from Intuit’s gradual migration to the public cloud.
Benny Wong (Timehop)
Slides:   1-PDF 
You've read everything on Hacker News. You've deployed a bunch of apps on Heroku. You've signed up for as many stats and APM services as you know of. You have everything you need to scale if your startup blows up, right? In this talk, we'll walk through some of the lessons we've learned the hard way growing 13x over the past 9 months.
Brendan Gregg (Netflix)
Slides:   1-PDF 
There are many performance tools nowadays for Linux, but how do they all fit together, and when do we use them? This tutorial explains methodologies for using these tools, and provides a tour of four tool types: observability, benchmarking, tuning, and static tuning. This includes perf_events, ftrace, and other tracing tools that use the tracepoints, kprobes, and uprobes frameworks.
Robert Peters (Verizon Digital Media Services )
Slides:   1-PDF 
Building and operating a global delivery network is all about providing high standards of performance and reliability. Rob will share recent experiences about how Verizon Digital Media Services lives up to those standards with a combination of technology, tools, and culture.
Mehdi Daoudi (Catchpoint)
Slides:   1-PPTX 
DevOps is so much more than just a cost center – it’s a valuable initiative that, if executed properly, can greatly increase a business’s bottom line. Join Mehdi Daoudi, CEO and co-founder of Catchpoint Systems, to learn how you can enhance your IT and DevOps departments, thus maximizing efficiency and saving the most valuable resource of all: TIME.
Buddy Brewer (SOASTA)
Slides:   1-PDF 
In this talk, Buddy will discuss ways the Velocity community can bring different sets of data together to lead these vastly different groups through a shared conversation about performance.
Tammy Everts (SOASTA), Cliff Crocker (SOASTA)
Slides:   external link
There’s no one-size-fits-all web performance metric. In this session, we’ll walk through various metrics that answer performance questions from multiple perspectives--from designer and DevOps to CRO and CEO. You’ll walk away with a better understanding of your options, as well as a clear understanding of how to choose the right metric for the right audience.
Michael Brunton-Spall (Government Digital Service)
Slides:   1-PDF 
Odds are, recently you've heard something along the lines that the one true architecture for any new system is a microservice-based one. The problem is, I can't be sure that your view of microservices is the same as mine. I can't point at any architecture and confidently declare that it is monolithic or microservices. The underlying issue is that the terms we use are dangerously undefined. 
Tim Kadlec (Akamai)
Slides:   external link
Loading images isn't as simple as you think. We often discuss image weight and download time, but mobile devices deal with many other challenges, such as decode times in underpowered CPUs, limited CPU/GPU transfers, and memory consumption. In this talk we'll explain those limitations, their impact, and how you can best handle them on your own site.
Jessica DeVita (Microsoft), Jennelle Crothers (Microsoft)
Slides:   1-PPTX 
Microsoft is committed to making software that delights developers and IT pros alike, enabling our customers to be productive anywhere on any device. Jessica and Jennelle will show you how Microsoft, a force in the market for four decades, continues to innovate and reinvent itself on a massive scale.
Mandi Walls (Chef)
Slides:   1-ZIP 
This follow up to my 2012 Velocity talk, “Challenges to Cultural Change,” will examine a number of common themes presenting in technology organizations of varying size. These pathologies exhibit strengths and weaknesses around tasks, behaviors, and treatment of personnel that affect the day-to-day running, and long-term success, of IT projects.
Mohammed Aboul-Magd (Akamai Technologies)
Slides:   1-PPTX 
Delivering rich images fast can be a difficult task. Join this keynote session featuring Akamai’s Mohammed Aboul-Magd to better understand how to deal with the ever-increasing number of images on the web. You’ll learn how to reduce cost and complexity while delivering a consistent and high quality viewing experience to every user, on any device, anywhere.
John Feminella (UpHex)
Slides:   1-PDF 
Knowing how to measure your technology and business operations is crucial for optimizing them, but it's nowhere near as important as knowing what metrics to look at in the first place. In this talk, we'll discuss how to pick metrics and associated visualizations that maximize your chances of making the right call for your business.
Gopal Brugalette (Nordstrom)
Slides:   1-PPTX 
Successful preparation for major traffic events requires a close partnership between Business, Product Management and Technology. In this talk we will explore how the different organizations across Nordstrom work together to prepare our site for its two major online events each year.
Jason Ding (Salesforce.com)
Jason will discuss how his cross-functional team proactively addresses scale limitations in our cloud stack. They work with field solution teams, integration partners, and the customer to understand their data model, customizations, user interactive and batch processing peak traffic patterns. They implemented a simulated workload to identify software and hardware limitations in SFDC’s Saas stack.
Manish Vachharajani (F5 Networks)
Slides:   1-PPTX 
Your network should be working for you. If it’s not, then something’s missing from your toolbox, and you’re probably writing more code than you need to make up for it. Taking inspiration from past Velocity talks, this session will demonstrate how network components like load balancers and application proxies can accelerate your application and your development process.
Laine Campbell (Pythian)
Slides:   1-PDF 
As technology jobs become increasingly hard to fill, the average starting salary for an engineer in the Valley is more than the median family income in the US in many demographics. Laine will discuss how to build your organization to embrace a culture and process that drive diversity in recruiting, hiring, and retention.
Mikey Dickerson (Federal Government | United States Digital Services Team)
The U.S. Digital Service was launched in August 2014 as a crazy gamble to see if the strategy that succeeded at pulling Healthcare.gov out of the fire could be applied to other government problems. Now at nine months old and about 130 people, the U.S. Digital Service has taken on immigration, education, Social Security, cyber-security, and Veterans Affairs.
Peter Blum (Instart Logic), Jesse Maula (Idean)
Slides:   1-PDF 
Responsive web design offers a solution to deliver the right web experience to every device. But its use can result in poor page load performance, which is made worse by bottlenecked wireless networks. Peter and Jesse will discuss some of the challenges around responsive web design, and solutions to dramatically improve page load times when using RWD.
John Yi (Rackspace)
Slides:   1-PDF 
Running Docker containers on physical servers is the ideal scenario. However cloud users are running their containers on VMs, adding back the overhead. OnMetal provides a way to run containers on physical hardware. We'll discuss running Docker OnMetal and some of the different ideas on leveraging OnMetal.
Rajiv Kurian (SignalFx)
Slides:   external link
At SignalFx, we deal with high-volume, high-resolution data from our users. This requires a high performance ingest pipeline. Over time we’ve found that we needed to adapt architectural principles from specialized fields such as HPC to get beyond performance plateaus encountered with more generic approaches.
Laura Bell (SafeStack Limited)
Slides:   1-PPTX 
Sometimes the best way to build a resilient application is to repeatedly break it. What if building secure applications needs the same sort of destructive play? This talk explains why, to protect organizations and applications from modern security threats, we don’t just need to defend our applications--we need to destroy them first.
Patrick Meenan (Google)
Slides:   1-PDF 
Service workers provide a fully programmable in-browser proxy which presents a fairly open slate for doing all kinds of things. Here we will explore practical applications (with real examples) for preventing third-party SPOFs, improving performance, and getting more control over your user's experience interacting with your site. All with effectively no changes to the pages themselves.
Ariya Hidayat (Shape Security)
Slides:   external link
Writing fast JavaScript code is not only microbenchmarks. Understanding the JavaScript engine will reveal the typical code patterns favored for maximum execution speed (short function, fixed object shape, profile-guided, garbage minimization). In addition, it is also important to consider other strategies such as sampling and tracing.
Siddharth Ram (Intuit)
Slides:   external link
Track: Deliberately unstable systems. About: This presentation will cover patterns used at Intuit that allow us to manage the characteristics of instability. Right technology patterns, right cultural changes, and right mindsets have moved a large organization from a scaled-up, shared everything to a scaled-out, shared nothing architecture.
Aaron Suggs (Kickstarter)
Slides:   1-PDF 
Discover the benefits of training your entire organization to contribute code. Kickstarter teaches GitHub for Poets, a one-hour class that empowers all staff to make improvements to our site, and fosters a culture of transparency and inclusivity. Learn about how we’ve made developing with GitHub fun and safe for everyone, and the surprising benefits of having more contributors to our code.
Baron Schwartz (VividCortex)
Slides:   1-PDF 
This is a story of how systems taught me about people, and people taught me about systems. Once you see it, you can't unsee it: systems and teams both do work, and have the same types of bottlenecks. The real breakthrough is figuring out what to do about it, and that's where systems and people are very different. I'll share in detail.
Zach Leatherman (Filament Group)
Slides:   external link
The default loading behavior for web fonts, making the text invisible until the font files have successfully loaded, is harmful to a performance-focused website. In this talk we’ll learn a variety of methods to progressively render web fonts and show a fallback while the fonts are loading. We’ll also show techniques to group repaint of web fonts from fallback text to improve usability.
Sam Newman (ThoughtWorks)
Slides:   external link
Microservices seem to be all the rage nowadays, but I wanted to understand what makes them tick. This talk, based on my own practical experience and research for the _Building Microservices_ book, presents the core principles and practices that are key to doing microservices well.
Mark Maun (Ticketmaster)
Slides:   external link
Mark will talk about his journey at Ticketmaster, where he rolled out tools and automation, and not only made the Ticketmaster software factory more efficient but also invoked cultural change in the workplace. Rather than mandate cultural change by executive fiat, we did it organically from the ground up. Tools and technology, not executive memos, were the origin of this transformation.
Ariya Hidayat (Shape Security)
Slides:   external link
What actually happens in the complicated network of undersea fiber optic cables, the foundation of modern communications? The talk gives an entertaining journey through this remarkable achievement, a series of scientific milestones in recent human history that will tickle the geeky side of the audience and inspire further exploration.
Sam Guckenheimer (Microsoft)
Slides:   1-PDF 
This is the story of Visual Studio’s journey to cloud cadence and DevOps. Sam will talk about our cultural transformation and share details on the mistakes we made along the way, the lessons we learned as well as our successes as we transitioned from shipping every three years to every three weeks.
Andy Davies (NCC Group), Simon Hearne (NCC Group)
Slides:   1-ZIP    external link
We increasingly rely on third-party scripts to deliver business-critical features of our sites, but do we understand the impact they're having on our site's performance and our business? In this session we’ll share the insights we’ve gained from analysing the impact third-party scripts have on the performance of our customers' sites, the value they deliver, and how our clients manage them.
Kurt Andersen (LinkedIn)
Slides:   external link
The email world has been changing over the last 10 years to thwart its exploitation by abusers and spammers. Domain authentication is the biggest step in this evolution. This tutorial will help you know what to require of vendors or to implement in your own systems to ensure that you can effectively deliver your message to your end users--reliably, safely, and while protecting your brand identity.
Mike Demmer (Jut), Chris Christensen (Limelight)
Slides:   1-PDF 
Analytics is the future of monitoring for DevOps teams. Learn how to think about typical monitoring tasks as systems analytics challenges, and begin to think about asking the right questions to inform your decisions. This session will show real examples of systems monitoring, unified analytics across logs and metrics, and statistical techniques that you can implement today.
Tim Sheiner (Jut)
Slides:   1-PDF 
Get a UX professional's view of the operations world... it's like a look in the mirror, except you'll walk away with tangible ways to rethink how you communicate about your challenges, your work, and the opportunities for Ops within your business.
Aneel Lakhani (SignalFx)
Slides:   1-PDF 
Because we want to win. OODA - Observe, Orient, Decide, Act - is pop-tech-devOps-unicorn-buzzword-cargo-culting canon. But the idea is more interesting than the myth. It's not about moving fast faster fastest. It's about changing the game being played. It’s about shrinking the time to do some things so you can spend more time doing other things. It’s about the time we have and where we spend it.
Ilya Grigorik (Google)
Slides:   external link
The limitations of HTTP/1.X forced us to develop various application workarounds to optimize performance, but in the process we've also introduced numerous regressions: poor caching, unnecessary downloads, delayed execution, and more. Well, good news, because HTTP/2 eliminates this tradeoff: we can eliminate the hacks, simplify our applications, and deliver improved performance!
Rob Woolley (Solium)
Slides:   external link
How can a small and highly technical team help empower a business? Especially when that business is making the move to operate globally and grow its presence in new markets? By building trust between all teams as we pragmatically embrace DevOps principles.