Transcript: Performance

00:00.0

Hello, and welcome to another episode of Django Chat, a weekly podcast on the Django web framework.

00:11.0

I'm Will Vincent, joined as always by Carlton Gibson. Hi, Carlton.

00:14.0

Hello, Will.

00:14.7

And this week, we're going to talk about performance. So performance matters because it is the probably

00:20.8

most important part of the user experience. Google punishes slow sites with SEO these

00:25.1

days and even something like amazon with e-commerce has done studies showing that just a 100 millisecond

00:31.0

slow down can cost a percentage of sales and what's 100 milliseconds that's like a blink of an

00:36.8

eye um what's the actually as we get into it what's the default these days for how was it 300

00:43.4

milliseconds a user can't tell the difference but then after that every 100 milliseconds i think

00:48.4

oh well there was i don't know but there was something about iphone um back in the day when

00:53.9

playing with those sorts of things the um there was something about the responsiveness there was

00:59.7

a 300 millisecond yeah 300 clicking in a wet in a web view and it used to drive users mad so people

01:07.6

would be building web apps no native apps but using web views and there'd be this this this

01:13.9

noticeable delay yeah yeah yeah versus a native button which just went off straight away and i

01:20.9

think the delay was 300 milliseconds and they there was some can we get rid of it no we can't

01:24.9

get rid of it because of their uh but that was like a glitch in the system yeah and it just gets

01:29.9

super frustrating i think 300 milliseconds definitely is noticeable yes yes but i don't

01:35.5

know why isn't so we're going to go through it's a whole set of tools and approaches yeah go ahead

01:41.8

well no web web framework uh no web performance metric was um they talked about less than one

01:47.6

second screen um to glass like if you could if you if your full load time to to interactive on

01:55.3

the glass was less than a second then users considered that fast and that's like the the

01:59.2

gold standard that's having your your your html delivered your javascript on there your your um

02:06.1

css in place so at least your first layout done even if you're pulling in images and things like

02:11.4

that but also your page clickable and the two glasses that it's responsive that it's it's

02:16.6

rendered and responsive in under a second and that's considered like you know that's quick

02:21.3

that's and you're talking about on mobile here right i mean on a website i say it's less than

02:26.3

i think over time people have come to expect better but that's of course no but i think

02:33.5

that's pretty good you bang that into even say it's at your desktop even with a decent connection

02:37.8

you put that into your desktop browser and you type any any site that's loading any kind of

02:43.6

javascript and like half of them will be slower than a second so anyway that that that that one

02:49.9

second to glass idea is is kind of like a benchmark um and if you think about how long it takes to

02:55.9

load javascript and for it to um well all the assets to arrive you've got network latency and

03:01.8

then you've got the loading time then you've got the rendering time it doesn't give your jango web

03:06.5

application very much time to respond right you're going to meet that right exactly and last point

03:13.3

before we get into all this i do want to mention uh donald newth is that how you say his name um

03:17.5

knuth knuth knuth yeah it says says wikipedia i always thought it was new that was going new for

03:22.4

years and then i looked it up it's knuth so knuth he's a he's a stanford you know he looks at email

03:27.5

i think like once every six months and thinks all these deep thoughts and he has these incredible

03:31.5

series on computer science anyways here's his quote on performance before we get into it which

03:36.1

is quote the real problem is that programmers have spent far too much time worrying about

03:39.7

efficiency in the wrong places and at the wrong times premature optimization is the root of all

03:44.0

evil or at least most of it in programming so you've probably heard the second half of that but

03:48.7

i think it's in context it makes a lot of sense around uh as we get into all these things

03:54.2

basically think about what you're doing don't just blindly whack every performance efficiency

03:59.2

mole that pops up because those will be infinite but also like what's it turns out as a sort of

04:07.1

matter of fact just from you know how programs behave in the wild that most of your performance

04:13.5

issues will come from a very small number of places and you won't be able to predict where

04:17.9

they are in advance so what you need to do is build it just build it simply and as sanely as

04:23.4

you can don't make optimizations yes you know just don't spend time optimizing it at all and then

04:30.6

profile it and see where the the three performance bottlenecks which are taking 70 percent of the

04:36.2

time are optimize those and with for a fraction of the effort of micro optimizing everything as

04:42.3

you went along you've got a more performant web application yes but it is tempting to try to do

04:48.6

it locally so yeah well it's interesting isn't it okay you know can i can i if i use exists versus

04:55.7

count do i get a four millisecond yeah okay well let's get into that so how do you baby step up it

05:01.2

So the very first thing I would say is you have to have Django debug toolbar,

05:05.2

just to, that's a third party package.

05:07.5

Uh, it.

05:08.7

that gives you configurable panes, configurable panels, so you can see the request response cycle

05:13.9

of a page. Basically, it shows you how many queries are there and how long it takes to load

05:18.6

locally. So this isn't a proxy for production, but it gives you a quick look at it. And the two big

05:25.6

ones, again, do this in production, the two big things you're going to want to look at for queries

05:29.6

is select-related and prefetch-related. Do you want to take a stab at those, Carlton?

05:35.7

okay so so if you so there are as well as django debug toolbar which you'd use locally there are

05:41.3

things called um application apm what does the p stand for application something monitoring i can

05:46.5

never process maybe yeah process but let's pretend it's process it might be something into it could

05:50.9

be performance um but anyway apm so um there used to be one called op beat which got bought up by

05:56.6

elastic and then there are other ones out there um what's the new relic was for a while roll bar

06:02.9

I always think Sentry should have one, but I don't think they do.

06:05.9

Yeah, that's error-checking that.

06:07.2

But I think Rollbar is Datadog, I think.

06:10.4

Right, okay.

06:10.9

But anyway, there's loads of these, and they just wrap something around your application

06:13.8

which monitors how long execution times take.

06:16.2

And what you're going to find if you've got any old normal application

06:19.5

is that your biggest hit is database.

06:22.6

Yes.

06:23.0

Is the time to fetch data from the database.

06:25.0

That's most of your response time from your Django application.

06:28.8

So what SelectRelated does,

06:32.0

is it when there's a foreign key you can say hey can we just join those two together with an sql

06:37.7

join and can we get them all in one database here rather than um two or more um you know if it's

06:44.4

right because one larger one for each database hit is each for a related object um and then there's

06:49.9

the the other so the other option is prefix related which is for many to many or many to one

06:55.3

relationships where you want to fetch um i know all the authors and all the books or i don't know

07:00.7

together look can a book happen yeah a book could have many authors so that's fine um and what that

07:05.3

will do is it'll fetch the authors and then it will fetch all the books that are related to the

07:08.9

authors and it will do that in in a couple of database queries rather than it can't do it in

07:13.6

one because it can't do the join but it will do it in two database queries rather than you know

07:20.7

perhaps potentially hundreds and i think uh the history of django so select related related was

07:25.9

always there and then i believe prefetch related was added later i'd have to go and look to be

07:30.4

honest but yeah that's these are these are the two hammers that you're going to want to use as a as

07:35.5

a first step generally speaking when you see a page that's loading slowly and django's test suite

07:41.0

has a really cool cool tool called assert none queries which if you write a unit test fetching

07:47.1

the data you want i haven't used that you can you can assert that only one query was made when you

07:51.8

select related to fetch your data and so you can kind of something carlton sometimes look at that

07:57.3

Well, I just found it in the Django test suite.

08:00.2

I'm like, what's that?

08:01.0

That's quite exciting.

08:02.4

But yeah, so you can write a unit test fetching your, you know,

08:05.5

so say you've got a convenience method which wraps all the data you need

08:10.0

for your view and returns it nicely.

08:12.0

So, you know, so you keep that logic out of the main line of the view.

08:15.6

You can test that method used with a certain number of queries and say,

08:20.8

look, I'm expecting this to make two queries because I'm using prefetch related.

08:23.9

I want one for the authors and one for the books,

08:25.6

and I don't want any more queries.

08:27.3

Um, so that when you, you know, iterate through your list in your test, it says, yeah, I did

08:33.9

fetch all of the objects here in two queries rather than one for the, for the author and

08:38.7

then one for each of the books as I traversed the relationship.

08:42.8

Right.

08:43.5

Yeah.

08:43.8

Yeah.

08:44.3

I like that.

08:45.6

Huh.

08:45.9

Okay.

08:46.1

I'm going to have to use that.

08:47.3

So what else?

08:48.0

So, yeah, so, so reduce the number of queries, right?

08:51.2

That's the first thing.

08:52.2

And then make sure your queries are efficient.

08:54.1

So that's the second thing I'd say is indexes.

08:57.0

So look at the queries you're making in your views and then make sure that those you can use explain the database explain query sets now from 2.2 have an explain method, which is kind of nice.

09:09.9

It saves you having to extract the query from the query set using query query and then putting that into your shell to explain it.

09:17.3

You can just call explain and it will give you it will send that off to the database and ask it to explain says what it's doing.

09:22.7

And you have to do that a few times and read them.

09:25.1

But it will say, look, and now I'm doing a row scan.

09:27.4

And what a row scan means is I'm going through every single row of the database table to see what the matches are.

09:33.8

And what you don't want that, that's where you want an index.

09:36.7

Because you want it to just go and look up in the index and get the matching values from the index, which is a much quicker operation.

09:43.0

So reduce the number of queries and make sure you're using index correctly.

09:46.3

That's my big one and two.

09:47.8

And then you were just about to say?

09:50.0

Caching.

09:50.8

Yes.

09:51.3

which we have a whole episode on this and we talked about that briefly before.

09:55.9

Yeah. Um, so cash all the things, uh,

10:00.0

well, yeah, but well, you know,

10:01.2

if it took a long time to get out of the database and you're getting it all the

10:04.8

time, cash it. I worked on a site which was, uh,

10:09.9

an API which was serving social media data.

10:13.5

I know as a competitor to some, one of these, um, I can't,

10:17.5

remember see so long ago but anyway it was it was social media data mining nonsense and they had

10:23.4

clients that were um making lots and lots and lots of api requests all the time and every every

10:29.6

request they had to um check the api key so you don't want to go fetching all the api keys from

10:37.1

the database every single request just to check whether the api matched so we would fetch it once

10:42.5

an hour or whatever and we would then check against the cache where the api key was correct

10:49.0

rather than against the database because well that's quicker right but the key thing is you're

10:55.2

doing this on real live production data because again i'll say this again to folks don't waste

11:00.5

time doing this locally it's so tempting to do but you you need yeah but you need um the code

11:05.6

path has to be the same right so that django gives you a dummy cache backend which is great for local

11:10.6

development because it it's it's it exposes the cache but it's just it doesn't work it doesn't

11:15.7

do anything so you can say is this in the cache no it's not because the dummy cache never caches

11:20.6

anything and then you can go and hit the database so in in development even though you need you just

11:25.4

use a dummy cache back end it's a bit like using the console email back end you know right right

11:30.3

yeah yeah uh indexes i want to quickly note so well one thing which i think is cool is that

11:36.0

starting with 1.11 you can do this in a meta class on your models instead of adding a db index field

11:43.1

i personally find that doing it through meta is a little more readable and i can put put more

11:47.7

things in there um but what would be so indexes are abused like what's the downside of just

11:53.4

indexing everything right what time and space so it takes so when you when you if you've got an

11:59.2

indexed field yeah yeah index field it will column in the database table it will um take longer to

12:07.9

write that record to the database if it has to index it at the same time so that's time so right

12:15.1

performance is impaired if you've got an index in place but also space because um you know it's like

12:23.1

a phone book right so the classic example of an index is a phone book where i you know i want to

12:27.0

someone's phone book phone number up by their name okay so instead of going through the list

12:33.0

one by one i can just go to the alphabet look at get to the right place in the alphabet and get the

12:37.6

number um but phone books are big and fat right they take up a lot of space on the coffee table

12:44.2

under in the hallway under the telephone so that's the same same problem for every index that you add

12:48.7

right they're not costless yeah they're not costless but to be honest on balance are you

12:55.4

making actual queries you on this data if so you probably want an index in play but until you've

13:02.4

started and once you're building before you build your application you probably haven't designed it

13:06.0

well enough to know which columns you'll actually be querying right i think that's the key thing is

13:11.0

your schema can and will change and especially once you start indexing schema 1.0 it's that's

13:17.7

the wrong approach i've done that i don't sorry i don't quite like if you you start you start with

13:22.9

your basic schema yeah you go hog wild with indexes and then you find out that i want to

13:27.5

change the end that change the schema around because my data is different or i add new features

13:31.4

but then you've got the indexes and it's too much too much it's premature optimization

13:36.9

yeah i mean and this this is um you know this happens um in uh no sql land quite a lot because

13:44.8

for instance couch tv you have to you you have to create these views which are essentially indexes

13:50.4

and you have to specify them up front and so you think oh i'm gonna here's an index i'll create

13:55.4

this or here's a view i'll create this um and it goes and processes them all and then you realize

13:59.2

that was totally wrong and this happens with elastic search as well because you think i'm

14:02.7

going to search on this so you create an index you know searching with these fields and then

14:06.4

you have to re-index it all you realize that isn't right it's quite difficult um but well this is the

14:11.8

thing with no sequel it's it needs to be used appropriately because when you first start using

14:16.5

it you're like this is amazing like i don't need to worry about anything yeah yeah but uh so anyway

14:22.7

use django debug toolbar when you've got your application running you're like i'm going to

14:26.6

deploy this okay so go through locally go through see what the actual queries are use the explain to

14:32.4

um are these queries sensible put an index in is is it improved and whilst local isn't a proxy for

14:41.2

production it kind of is in in that it won't tell you the exact numbers but it will tell you the

14:46.2

relative scale yeah i mean look at it for sure yeah uh and we mentioned you know the other if

14:52.7

there's a fourth big area i would i would say be the front end assets which is a django developer

14:56.8

you probably don't have as much control of but you can use tools for example you can use django

15:01.2

compressor third party super package um carlton maintains well you want to use a cdn what you

15:07.0

help with um you can use a cdn uh you know i there's a whole actually a link to it um i don't

15:13.1

know how to pronounce it, Adi Osmani, who's at Google, has a whole free web book on images,

15:20.2

which is really fascinating.

15:21.2

I mean, for example, if you haven't thought about it, you can use Easy Thumbnails, which

15:24.9

is a package.

15:26.2

So that rather than showing the two megabyte version of like, let's say it's a photo, a user profile photo, and someone can upload a photo. But when you show it on the screen, it's a tiny little thing where you can have a thumbnail version of it, the full version of it. These are sort of basic steps that are really performant. So just front end assets in general, and especially if you look at Google PageSpeed, there's other ones, all the major browsers have to evaluate site speed, they will help you especially with the front end assets to see like your JavaScript is way too big.

15:56.2

Yeah, or is your web server configured to cache these, to send the right caching headers, to say, look, you know, this CSS file, cache it indefinitely.

16:05.3

And one thing that Django Compressor will give you is a nice concatenated file, but it has a silly hash in the file name.

16:14.2

So you can cache that forever, because if you change your CSS, that hash is different, and so the file name is different.

16:21.2

And so you can configure your web server to tell the browsers and to tell the proxy caches

16:27.2

out there, caches forever.

16:28.9

Oh, it's fun to do performance stuff.

16:30.6

It's just never ending.

16:32.9

The last thing I would say, and you can say what you like, Carlton, is I find that the

16:37.8

Django extensions third party package is very helpful because it's got a whole bunch of

16:43.5

things, but specifically with Shell Plus, which will auto load models into your shell

16:47.4

when you need to drop in.

16:48.6

It also has Run Server Plus.

16:49.8

it's sort of a swiss army knife of tools i find myself using it all the time because whenever i

16:55.4

go into the shell i want the models loaded and it's just i can't live without django extensions

17:01.0

yeah i'm a big fan of it it's um it's got this um i can't remember the exact command but it's

17:06.7

got this ability to output a picture a dot a dot file which is a graphics format um of your models

17:14.7

and you can drag that into you can either view that in in what's the command line program i

17:20.5

can't remember but you i drag it into omni-graphle and then you've got a nice diagram of all your

17:25.0

models and the relationships between them and you're like hey that you know i love that oh

17:29.2

yeah well i mean the hard thing with all these favorite packages and tips is just figuring out

17:36.1

the the priority and the curation of them uh this is why for example like uh so uh awesome

17:42.2

django is a repo i maintain there's a whole bunch of third-party packages and i'm um i appreciate

17:47.4

lots of prs and issues people put in there but i don't want it to be a thousand packages long

17:52.4

i'm trying to keep it curated but when carlton i mentioned django debug toolbar django extensions

17:58.3

i would say almost every django site should use those yeah i don't have a problem saying that

18:03.6

it's amongst the packages that i'd pip install without you know really having any concerns

18:08.1

yeah that actually would be a cool thing to like what's your top you know your top five top ten

18:12.7

third party must-haves if we uh surveyed some talking heads and that might be a fun thing to do

18:19.7

yeah i should do that um any last things on performance we've we've really hit the kind

18:25.4

of the high points but jango probably is probably isn't your um problem as long as you're not making

18:31.3

200 database requests on a single thing you know if you do the basics it's jango probably isn't

18:37.2

your bottleneck for you know most web apps it's your javascript and your front-end stuff that's

18:42.9

that will have more of a an effect but you know if you're pushing if you're pushing it to you know

18:49.6

to its you know if your server is doing something intensive is your january application that's

18:54.1

driving it then you select related prefetch related in indexing then caching things like

19:00.6

serialization can can cost time you know rest framework if you're using rest framework

19:06.0

if you're really pushing that to limit serialization it's like template rendering

19:10.8

it's an expensive process so there are alternate serializing options which you might go for if you

19:16.8

were really you know driving it make sure your middlewares are optimized um you know the list

19:22.3

but what did you know premature optimization is the root of all evil chances are you will get the

19:29.7

throughput you need doing the the two or three things which are eating all the time rather than

19:34.9

you know worrying about oh should i should i spend a week changing my serializer layout to

19:40.9

yeah i was gonna say i mean 20 of the effort will get 80 of the way there um but i feel like i've

19:47.3

been saying this a lot recently but it is so tempting to dive into these small little micro

19:52.0

changes that will have an impact and ignore talking to users you know changing more important

19:59.3

things around but it um feels personally it feels better to endlessly optimize performance so i have

20:06.9

to watch myself with that well it's like the equivalent i mean you know some people their

20:10.7

busy work is answering email right for some yeah i'm doing a little i'm doing a performance

20:15.9

optimization look i've got five percent more throughput but yeah this thing is called once

20:19.9

by a back-end worker right totally pointless anyway okay that's the high points as ever we

20:25.5

are at jango chat.com uh or chat jango on twitter for the dyslexic folks out there

20:31.0

and we'll see you all next week bye-bye all right take care bye-bye