Pelikan Cache (@pelikan_cache) Twitter Tweets • TwiDoom

Pelikan Cache

2 years ago

In-memory cache design generally faces the choice of introducing internal fragmentation (allocating more than used) or external fragmentation (taking more memory than allocated). The former reduces effective capacity, while the latter could trigger OOM. #CachingWisdomOfTheDay

thumb_up_off_alt7

chat_bubble_outline2

repeat2

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Does cache have to keep data in DRAM to perform? The answer turns out to be highly workload/design dependent. The performance gap between DRAM and SSD (or PMEM) continues to be much bigger for random writes than reads. For many reads, SSD is fast enough. #CachingWisdomOfTheDay

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

It’s easier to design data structures for cache than database, meaning very loose consistency and relatively forgiving availability requirements. But even so, perfect generic data designs don’t exist. It’s often more valuable to specialize by workloads. #CachingWisdomOfTheDay

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

What can you do when you can consistently log every command except the values to (a slice of) you cache backend? A lot. Hot keys, heat map, working set size and miss rate estimates… In fact, most of the cache config parameters can be correctly inferred. #CachingWisdomOfTheDay

thumb_up_off_alt1

chat_bubble_outline1

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Threading model is especially important for secure cache access. While modern hardware has made encryption much faster, TLS frames add memory & CPU overhead. Worse still, TLS handshake is 10x more expensive than TCP– connect is a serious threat to SLO. #CachingWisdomOfTheDay

thumb_up_off_alt4

chat_bubble_outline2

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Cache writes are not always more expensive than reads. Simple read/write can achieve similar RPS because RPC costs dominate. There are some “tipping points” wrt RPC- e.g. you can expect sudden dip on RPS at MTU, socket buffer, and/or request buffer sizes. #CachingWisdomOfTheDay

thumb_up_off_alt7

chat_bubble_outline2

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Always have a TTL for anything put in cache. To list a few reasons: - it provides a well defined bound on data inconsistency; - it gives cache backend a strong hint on how to retain useful data for the right duration; - it helps you stay GDPR-compliant. #CachingWisdomOfTheDay

thumb_up_off_alt12

chat_bubble_outline2

repeat3

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

In cache with data structures, writes tend to be incremental. It’s important to keep memory op proportional to update instead of object size. A DIMM carries ~40GB/s. If updates against 1MB objects involve memcpy whole objects, throughput will be <40K RPS. #CachingWisdomOfTheDay

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Accessing a large in memory dataset is unlikely to be CPU cache friendly, even if it has a fairly skewed popularity distribution. What could / should be kept in CPU caches? IO buffers. How? CPU pinning, flow steering, and keeping storage threads away. #CachingWisdomOfTheDay

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

You're probably familiar with read vs. write. But also consider: - idempotent vs. non-idempotent write: the former can be safely retried, not the latter; - regular vs. privileged command: e.g. you probably don't want people to run FLUSH_ALL willy-nilly. #CachingWisdomOfTheDay

thumb_up_off_alt3

chat_bubble_outline1

repeat2

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Friends don't let friends enable dynamic scripting in cache. All it takes is one bug or a poorly implemented algorithm to completely kill server performance. Scripting is useful but should be treated as other server code- tested, benchmarked, deployed. #CachingWisdomOfTheDay

thumb_up_off_alt5

chat_bubble_outline2

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Almost no where else in the infrastructure software would the following statement be more true than it is for caching: "Service delayed is service denied." Make it fast. Make it predictably fast. Make it consistently fast. #CachingWisdomOfTheDay

thumb_up_off_alt6

chat_bubble_outline1

repeat1

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

We know our documentation has a long way to go, so we are spending September, heads-down, to write about the design of Pelikan. If you want to known about anything in particular, please tell us in the reply. In October, we'll return with a daily tweet series on Rust + Pelikan.

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Open Source Startup Podcast🎙

@ossstartup

2 years ago

🎙️E49 of the Open Source Startup Podcast is LIVE🎙️ Ft. Daniela Miao, Cofounder of caching platform @MomentoHQ & Yao Yue 岳峣 from the Twitter team for Pelikan Cache ✨Check it out✨ anchor.fm/ossstartuppodc…

thumb_up_off_alt23

chat_bubble_outline2

repeat4

shareShare

Pelikan Cache

@pelikan_cache

2 years ago

Yep, we agree. betterprogramming.pub/software-compo… Corollary: If you do not know what *exactly* Pelikan Cache is, don't worry. It's by design.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Khawaja Shams

@ksshams

2 years ago

I am still amazed at how easy it was to run Pelikan Cache on Ampere Altra-based Tau T2A VMs on GoogleCloud. Ali Zhairati was able to make it work instantly and make it fly in the matter of hours! 🙇‍♂️

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Yao Yue 岳峣

@thinkingfish

2 years ago

If you are attending QCon SF, check out Juncheng (Juncheng Yang)'s talk on Ubiquitous Caching for the "Building Modern Backend" track at 1:40pm, where he discusses hardware trend, emerging workloads, and running large in-process caching using Segcache. qconsf.com/presentation/o…

thumb_up_off_alt12

chat_bubble_outline1

repeat4

shareShare

Juncheng Yang

@1a1a11a

a year ago

I talked about S3-FIFO at #SOSP today. S3-FIFO is a FIFO-based eviction algorithm that is simple, scalable, and efficient. For example, it can reduce the miss ratio of your LRU caches by up to 72% and improve throughput by 6x at 16 threads. S3-FIFO is being tried out at a few

thumb_up_off_alt95

chat_bubble_outline6

repeat21

shareShare