Caching

00:00.0

Hello, and welcome to another episode of Django Chat, a weekly podcast on the Django Web Framework.

00:10.4

I'm Will Vincent, joined as always by Carlton Gibson. Hello, Carlton.

00:13.3

Hello, Will.

00:14.3

And this week, we're going to talk about caching, which is a power tool of all developers, but

00:19.1

may not be familiar to folks newer in their career. And Django has some fantastic built-in

00:23.6

support. So we're going to get into all things caching. So Carlton, what is caching? Why

00:28.2

is it important?

00:28.8

so oh good caching is good if you you know say you've got some database query that takes quite

00:34.4

a long time and you're doing it all the time maybe you want to cache that and that just means keep it

00:38.8

hanging around so you don't have to make it again and ideally your cache gate fetching so if you

00:44.8

store a result that you got from the database you can put it in your cache and ideally it's quicker

00:49.2

to get it back from the cache than it is to go to the database that's the idea so performance right

00:54.1

yeah and that general idea that rather than spinning up the physical disk if you load it

00:58.2

into memory like the ram it's going to be faster that's the general idea yeah i mean sometimes you

01:03.1

might cache on the disk though for instance um yeah if there's a i know you're rendering a

01:08.5

complicated template and you've you've taken you know you've gone to the database you've got some

01:13.3

data you've rendered it into a template there's no reason what you've got then some html out of

01:17.2

that which was computationally expensive there's no reason why you couldn't rec cache that on the

01:21.4

disk because then all you've got to do is fetch it from the disk and serve it straight away rather

01:25.0

than do all that heavy computation before but normally normally we're caching right yeah and

01:29.6

we'll get into where where you can put the cache but that's the basic idea is you you pre-process

01:33.7

things so they can be loaded faster um and i mean i think something that's maybe a little confusing

01:40.0

is it's such a broad idea caching i mean so if you index a database if folks have heard of that

01:45.1

that's basically a cache but then you can also in django you can well for example django has a

01:50.3

built in caching system, where we'll get to where you store it. But there's four options to give

01:56.6

folks a sense of how to think about this, where you can do per site, so you can just cache

02:00.2

everything. So if you have a Django site, but it's basically a static site, if it's a blog,

02:04.8

that's not changing, you can just do a, I think it's basically one line, we'll link to this in

02:08.8

the docs, and just per site, cache everything. And so after the first time, the first time it loads,

02:14.7

someone will come in, hit the site, it actually needs to process. But then after that, everything

02:18.3

is just served from memory. And actually, on that topic, let's talk about refreshing the cash hot

02:26.8

and cold, because I think that's an important thing. Because there's this idea that, you know,

02:30.3

cash has to be run, it has to be hot. So for example, if I've changed, I have a blog, simple

02:35.8

blog site, I add a new blog post, the first person who comes and hits that site is going is not going

02:43.2

be cached even if i have caching run so either i just say well the first folk first person who

02:48.6

comes in is going to take that performance hit or you can um heat you can what's the term you can

02:54.2

run the cache in advance i never know what the terms for these things are you can preheat the

02:58.4

cache right so what you can do yeah it's something when you publish your own blog post you can go and

03:03.2

check that it appeared on the site and in so doing it will load and then it will be cached in your

03:07.5

your page so normally you cache per page right like django's got this nice page caching option

03:13.0

where you like each individual page has a so right so yeah so the four options per site per view uh

03:18.9

template which i guess is what you mean by per page i think i can never remember then you can

03:22.8

get into the low level cache api so this is something you want to play with in reality so

03:28.5

in production like you can make predictions all day long on local usage but you really just want

03:34.2

to see how your site actually performs but i would say with god we're just butchering this heating up

03:38.5

the cash warming up the cash i played around with this a ton and on a big site it's worth it maybe

03:43.9

but it's also fine to just say you know if i have thousands of visitors the first one on this page

03:48.8

when i do a change it can be a little slower for them and they'll live yeah like the pay the pay i

03:53.8

mean what's good about the cash and right so the pay however long it takes to go and get your blog

03:56.9

post out of your database to render it into the template put it on the page well okay the first

04:00.5

time that's a bit slow and then the index page needs to change as well because that you know

04:04.8

the one where it lists the first five blog posts the most recent five blog posts you need to update

04:10.7

that so questions about invalidation that we can come back to in a minute but so that first the

04:16.3

first person who loads that is a bit slow if that's you brilliant right well i i did that i i

04:21.6

manually again this is early days a startup i would mainly go manually go through and uh reheat

04:28.7

god uh these pages um but you know in practice when you're dealing with hundreds thousands of

04:34.0

users it all is it comes out so would you set this to cache forever no i don't think you want

04:40.9

to do that i mean you could if you're updating all the time i recall setting it for a very long

04:45.5

period of time though off the top of my head i can't remember what a long period of time is i

04:49.2

think maybe it was a month oh right well that is quite a long time i mean i've the thing is and

04:54.0

I mean, usually it's like a week or a day.

04:55.9

Yeah, or a day or even an hour, right?

04:57.8

Because let's say you've got one person coming

05:00.9

and hitting your Django application once an hour.

05:04.0

It's really not going to kill your Django application, right?

05:07.4

But if you've got 20,000 people all at once, that will kill it.

05:10.4

So if you can cache that blog post even for an hour,

05:13.6

it means that the Django app is only really doing the hard work

05:16.5

once every hour or once every day or once every week.

05:19.2

Yeah, and again, I'm thinking of this was very early stage

05:23.1

with a startup. Yeah, I think so. So there's a

05:26.2

timeout so there's arguments you can pass into the cache uh caching framework built in the django

05:30.5

and i mean the docs give an example of um 300 seconds so five minutes as

05:35.5

you know substantial period of time yeah i i think that what i said was way too long but whatever

05:42.4

play around with it you know this is why you want logging and other information too on your site so

05:48.5

you can see actually how fast the page is loading um you know it's a balancing act basically but in

05:54.3

general cache everything yeah i mean before we go on and talk about the details of django caching

05:58.7

there's another layer to think about which is could you get nginx or whatever front-end proxy

06:03.0

you've got to do it instead because nginx will serve files off the file system you know far

06:09.7

more efficiently than any application you can ever write and so if you've got a blog post which

06:15.0

perhaps you update i don't know never why not tell nginx to cache it on the file system and

06:21.4

And then it's just like for NGINX perspective,

06:23.4

it's just like it's serving a static site

06:24.9

and it's like not even talking to your backend.

06:27.4

And that's really quite easy to configure.

06:29.3

You give it a path and you say, look, file cache,

06:32.1

and you give it the module,

06:34.0

the amount of time you want to cache it for,

06:35.7

and it will just do it.

06:37.5

So that's worth considering.

06:38.7

What do you make?

06:39.7

I remember using Varnish,

06:42.4

which is a proxy cache layer

06:44.7

when I was doing this all on DigitalOcean.

06:47.2

What's your take on NGINX versus Varnish?

06:50.0

I mean, you could use both, right?

06:51.5

Because they do different things.

06:52.2

People do.

06:52.7

So for me, for your...

06:54.1

I mean, I remember Varnish was like a huge speed bump.

06:56.6

Maybe the biggest of all the things I did.

06:57.9

Right.

06:58.1

But Varnish is a dedicated extra layer that you can use and super powerful.

07:04.4

But I always say, don't go to these things until you need it, right?

07:07.7

So what's your base set?

07:09.6

I did not do that, of course.

07:10.3

Right.

07:10.4

But what's your base setup?

07:11.5

Your base setup is, you know, just for example, I mean, you might be using Apache or whatever,

07:15.3

but let's just go with one example.

07:17.2

You're using Nginx, we're going to go on with Django.

07:20.0

okay you've already got nginx in play and it's it's got first grade um caching module that's

07:28.1

really easy to configure you can use that and that will that will really will cope with you

07:34.6

know probably 90 of sites out there that's perfectly good enough and then if you are

07:39.6

really pushing it to limit then you're going to investigate whether or not you need another

07:43.0

dedicated caching layer on top and i would say this is the type of stuff that is it's really

07:47.8

fun to do because you can at the end of the day you can say oh i increased my you know or decrease

07:52.8

my load time by x amount it sort of scratches that developer itch but it is i'm certainly guilty of

07:58.1

spending way too long getting that last five ten percent when it was totally unwarranted so it um

08:04.2

i would say be aware that this is fun and feels binary um and so a lot of times you'll you know

08:10.0

neglect things like talking to users that is a little more gray you know marketing or any of

08:14.7

that stuff marketing yeah all these things design um okay so where do you put the cache so let's

08:19.9

talk about so historically so memcache was the the first big popular caching layer though these days

08:27.4

i think redis almost everyone would say redis if you're starting from scratch you would use

08:31.6

people like redis it's got some fancy redis a little bit simpler or no it's not but it's a

08:37.5

little bit faster for sure is it is that the case i i believe so i there i've seen we'll link there's

08:42.4

detailed analysis i believe in most cases it's actually faster okay i mean look back in the day

08:46.9

and so we're talking um you know early 2000s where memcache was the option you'd run memcache

08:52.6

you'd be there even into and it was this amazing idea right because it came out of

08:56.8

i'm sorry to interrupt but yeah i remember like it came out like live journal or something in 2005

09:01.7

like it was i don't i think it was like the first major uh caching of that type yeah and it just did

09:08.8

the job and it did it very well and massive adoption because of that um and still brilliant

09:15.9

right it still works and no reason not to use memcache unless you're already thinking about

09:21.2

using redis and again it's like how many components do you want in your stack so if you've got redis

09:27.0

running why not use redis as a cash back end right so i guess yeah the general thing um memcache is

09:33.9

a little bit simpler but if you if you need if you're going to need redis things anyways you

09:36.9

might as well just use redis for all of it well so let's talk about those things why would you

09:42.5

use redis so i mean basically for any queue like tasks so emails one example um what are some other

09:50.1

examples that come to mind of when you would so so i guess we're i'm confusing two things here so

09:54.4

there's caching and then you'd also use something like redis for queue based yeah so why would you

09:59.2

have redis yeah because you want to you want to use a queue so let's take a good queue packet so

10:03.2

you know everyone always talks about celery but celery's overkill for you know the majority of

10:08.3

use cases so what's a good package well there's one called django queue which i love and have fun

10:12.7

with that's nice and simple and that's got a reddish back end so you pip install or you know

10:18.3

apt install reddish and then you pip install django queue into your project you know a little

10:22.7

bit of settings magic and you're up and running what do you put in there anything that you want

10:26.6

to put out banter you know you're rendering a pdf you're something that's going to be process

10:31.6

Intensive that would sending an email you do any of these tasks that we talked about we talk about all the time and then you've

10:37.3

Got you've already at that point. You've got Redis in play. So you might as well use it as your Django cache back-end

10:43.6

For which you'll need a couple of pack or a package. There's a couple of options, right?

10:47.2

There's Django Redis and Django Redis cache and I can never remember what the difference is between these

10:52.4

two ones every single time i start a new project i have to go and search history what did i use

10:57.3

last time and is it still as good yeah well i was just updating uh my awesome jenga repo which has

11:03.1

a bunch of curated third-party apps and i was going through the exact same thing because you

11:07.6

know there's a redis section and i was like what is the difference there is a difference but it's

11:12.4

i can't remember either i have no idea like i so i i was looking this up before we started the talk

11:17.6

last time i did it i used jenga redis cache i've been very happy with that it turns out i've used

11:21.5

that loads of times in the past, but I've also used

11:23.4

Django Redis loads of times in the past, and I have

11:25.4

no idea why. I don't know which one's

11:27.6

good. Just don't peek

11:29.5

under the rug. There was some talk about

11:31.4

bringing a Redis

11:33.2

cache backend into core.

11:35.9

I think the general, the state

11:37.4

of play on that is, yeah, we

11:39.4

are keen on that, but it needs

11:41.3

a Django enhancement proposal, a DEP,

11:43.3

it needs someone to step up and write the

11:45.4

thing. But in principle,

11:47.9

in a, you know, two, three, four

11:49.4

versions time when someone's actually got around and

11:51.1

and written it there might be redis cache back end in django itself yeah because it it really

11:57.8

is on a decent sized site pretty much a guarantee you're going to have red redis or memcache but

12:04.7

probably redis these days yeah and you do want cache i mean like you know just the one thing

12:09.7

we haven't talked about it's not just the pages but the template fragments sometimes templates

12:13.0

are computationally expensive to render and if you've got i don't know let's say you're converting

12:18.2

user-submitted markdown to html okay first of all you've got to render that as html using

12:25.6

markdown and then you've got to run it for a sanitizer like bleach which uses html5lib which

12:30.3

is not necessarily the fastest library in the whole world you if you can cache the output of

12:36.4

that rendering then the next time you have to do it i mean you could cache it in the database say

12:40.4

say you've got that markdown stored in a model field you could have an extra model field for it

12:44.5

for the rendered html you could do it at save time but equally you might do it by caching the

12:48.9

template fragment yeah and i'm thinking this would be a great tutorial to do because for local

12:53.3

development just so folks can see that this actually works if you just have django debug

12:57.6

toolbar which in addition to showing queries will show local page load time which again isn't a

13:02.5

proxy for production but it gives you some sense if you just flip around the switches for per site

13:08.3

per view and just see how much faster it is i mean it is orders of magnitude faster obviously to serve

13:13.6

from a cache so i would say that would be the way to play around with it is just just simply

13:18.3

django debug toolbar and then you can there's more complex tools to see how fast in production

13:24.5

your pages are yeah reality really speaking if you've used one of these apm tools these

13:28.9

these profilers these these live production profilers that monitor your execution time

13:34.4

you will see that the number one place where you're losing time is trips to the database and

13:38.4

the number two time where you're losing time is rendering templates so yeah you know if you can

13:44.1

well after you know doing something stupid with the front end not stupid but doing something with

13:49.4

front end assets like huge images or something oh right okay but okay so here's here's the

13:54.2

interesting thing with caching right is is this actual time the time your django application took

13:59.3

to serve the response versus the perceived time that the user had on the endpoint so you know

14:04.0

let's say your Django application took 300 milliseconds to go to the database, render the

14:10.2

template, serve the response. You know, is that fast? Is that slow? Who knows? But let's say

14:14.7

you're loading, you know, two megabytes of JavaScript, which took two and a half seconds to

14:19.1

be responsive and to load on the client. The client isn't going to notice if you half your

14:24.3

response time from your Django application. They're just not going to notice because it

14:28.1

pales into insignificance. So quite often you'll see the front end people talk about this a lot.

14:33.7

the dominant factor in perceived responsiveness is how fast your page loads to the user so i how

14:39.5

much javascript yeah perceived how much how many images you i mean the images aren't even the thing

14:44.0

it's javascript how much javascript are you loading how long does that take to pull into the

14:47.6

page especially if you're doing one of these um single page applications these client-side rendered

14:53.8

things where it's got to load all the javascript then it's got to pull the data from from an api

14:58.5

and that's the bit where your django app does its thing and it takes 300 milliseconds and then it's

15:02.7

got to render all that into the page before the user says oh yeah the page loaded right and that

15:07.2

whole perceived time I mean it reminds me of so Instagram back when it came out because I was

15:13.0

actually working at Quizlet like just next door to them one of the things besides filters one of

15:20.0

the things that I remember being a wow moment was so this was still when the cell reception in San

15:25.4

Francisco was terrible a lot of places what they did is they as soon as you you picked a image you

15:31.5

wanted to load and start typing in all the information in the background, they started

15:34.5

loading it. So it felt really fast. You didn't, you know, press the button and then wait 5-10

15:39.7

seconds, it felt instantaneous. And I'm sure some others had done it. But that was one of the first

15:44.0

apps I saw that, you know, basically said, we're gonna, we're gonna blow up your bandwidth,

15:48.3

or your cell phone plan, but it, you know, in the background, and now that's a standard process,

15:52.3

anytime, I don't know, Tumblr or something, or Instagram, still, you know, when you're loading

15:56.4

an image, first thing you do is you load the image, and then you type in a whole bunch of

15:59.7

stuff in the background it's already processing so you can just click the button and go load

16:04.1

it yeah and I think the reality of you know you can you can Django gives you these caching

16:09.5

tools and you can use it to speed up your Django response but in a lot of cases the

16:14.0

real work is on the front end and you know do the basics and Django get it get it before

16:18.6

use that nginx caching layer that we talked about but don't sit there then worrying about

16:22.8

micro optimizations when you've got a front-end app that's likely to offer better return on

16:29.9

investment for that optimization time right and i guess in the last major point i would say is it

16:35.5

really it it depends it depends on the type of app that you have how often is the data updated

16:41.8

is it personalized for every user so if you think of facebook you know every you and i log not that

16:48.7

i have facebook um but if i did you and i log in there's different content being loaded there

16:53.2

i'm sure i know that in the background facebook is periodically loading those things into cache

16:58.7

so when you log in it's there but how often does that change if you have a timeline feed or twitter

17:03.4

right i mean that's updating quite a lot so that would be a little more challenging than

17:07.5

a blog or something that doesn't update as much where you can be a lot more

17:11.0

aggressive with the time limits that you uh yeah yeah exactly it's it's like how aggressively can

17:17.8

you cache it is like for a static blog post and do you have a mechanism for invalidating it right

17:22.7

so let's say you've cached a blog post in your you know redis whatever using the django backend

17:28.3

are you able to identify that by key such that when you update it you can you know use you can

17:37.1

when you in your save handler wherever you put that save handler you can say oh and invalidate

17:41.2

the cache so there's two problems in computer science right naming things cache invalidation

17:46.2

and off by one errors yeah yeah well and i guess the last point i would make is the cache is not

17:51.8

an infinite supply it's not the database so often you are finding yourself you're like well i'll

17:56.7

just cache everything all the time but um it's more expensive than uh database space yeah so

18:03.2

this but this is where file system caching comes back into its own right because everyone's like

18:06.7

right let's go straight from ram well ram can get expensive but file system can be cheap and

18:12.9

you know it's this it so there's this um you know when there's this idea about the different

18:20.3

latencies of different things you know l1 cache blah blah blah all the way down to yeah um reading

18:25.4

something off the disk and or getting something over the network it's like how far up that scale

18:31.6

can you move your relevant thing it's just a question of thinking about you know your requirements

18:35.2

and your performance things and all the rest, you know, it's like algorithm design all over again.

18:40.0

Yeah. And it's, I mean, again, it's for an engineering mind, it's sort of fun because

18:43.3

it feels black and white and you can see your progress. Um, I would, at last point I would

18:47.2

mention, so there's a, there's a book that's a couple of years old at this point, but still

18:50.8

very relevant called high performance Django by the folks at Lincoln that, uh, talks about

18:56.4

caching, but talks about a lot of these performance cause this all comes around performance. So

19:00.6

that's definitely worth a look. Um, we'll put the link for that in the show notes. Um, so yeah,

19:05.2

so i was just going to say there was some i remember when i was learning back in the day

19:09.5

and there was some there was some really good books on this sort of stuff and uh you know from

19:14.7

o'reilly and you know all the rest of them i don't know what the latest published books yeah i don't

19:19.8

know what the latest scaling books are you know what the latest you know high performance web

19:23.6

applications type things books are web scalability that well if i may indulge a slight rant so i was

19:29.6

updating my awesome jenga repo where i have a book section and there's still like almost no

19:34.8

up-to-date books up to date being you know actually written in the last couple years books

19:39.5

on django because it changes all the time so it's not that the advice especially around this stuff

19:43.3

is wrong but i would love to know about more up-to-date things i mean as far as i know um

19:49.3

so tango with django just released an updated book that's a classic that's been around

19:54.0

um but there's still i still think i'm almost the only one with 2.2 versions of my books

19:59.8

um so it's yeah if you think of if you find those we'll put them in the show notes but um

20:05.2

that you know it's nice in a way that that's the stuff that doesn't change as much i mean that's

20:09.0

the the challenge and opportunity for me as a content creator is i have to update things all

20:13.6

the time which can become tiring but also makes me do it better but it means that a lot of um i

20:19.9

sort of look longingly at these things like um algorithms and stuff that are more but i would

20:25.4

argue i would argue that the the principles of sort of web application scaling haven't really

20:32.0

altered in the 15 years that i've been sort of looking at it like it's it was that's true

20:37.4

now and you know maybe version numbers have changed but not you know not the actual way

20:46.0

you go about it the point is though since you already know how to do it quite well that five

20:49.9

ten percent that's changed doesn't throw you off whereas difficult for example people ask me all

20:54.2

the time what's the difference in the book between 2.0 2.1 2.2 it's about 10 15 actually different

21:01.8

content and if you already know django it won't throw you off but if you don't know django which

21:05.6

is why you bought the book it will definitely throw you know those those differences are fatal

21:09.6

oh what you know i've got it says 2.2 and i've got 1.8 what's going on here like you know yeah

21:15.1

i remember that i remember being in that exact position i couldn't i you know hating barriers

21:19.4

on my wall head on the wall for ages anyway that's a cheerful note to finish on yes all right so

21:25.3

caching it's important um hopefully this episode helped you all out we are as ever at the jango

21:30.5

chat.com website we are chat jango on twitter the episodes actually are also on youtube the audio

21:35.9

only if you prefer that i keep putting them up there and there are some subscribers but if you

21:40.0

prefer youtube uh go check it out we'll see you all next time bye