分布式架构的根基：深入浅出一致性算法

原創

2021-05-21 12:03

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"italic","attrs":{}}],"text":"分布式算法的介绍文章可谓汗牛充栋，但或是过于学术证明或是过于简单，笔者将尝试挑战用一篇文章，让近乎0基础的同学都可以理解一致性算法的原理。","attrs":{}}]}],"attrs":{}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"分布式服务的困局","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们试想一个常见的电商场景：超时订单自动关闭，在下单后X小时内未支付的话自动关闭订单并释放库存。这时我们需要有一个定时器定时触发相关的业务操作，从高可用的角度看这个定时器需要部署多个实例，但对同一订单仅只允许触发一次。要实现这个需求有多种方案，最常见的就是集群领导者选举，可以以实例或订单组为维度选出领导者并由其负责执行特定订单的触发。领导者选举有着广泛的应用场景，我们可以尝试将之抽象成独立的服务。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/3f/3fd7108ca6b0fca74fe7ac83fb9ac647.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如上图，实现非常简单：","attrs":{}}]},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"创建一个","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"领导者选举服务","attrs":{}}],"attrs":{}},{"type":"text","text":"，使用CAS原子化地设置变量 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 其值等于对应的实例Id， ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 的值在一定的存活周期后自动销毁以避免服务实例不可达导致没有可用的领导者","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"多个订单服务向领导者选举服务定期提交请求，希望将自己设置成领导者","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"领导者选举服务中 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 的值如果存在则直接返回，反之根据先到先得原则设置 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 的值为对应的实例Id并返回","attrs":{}}]}]}],"attrs":{}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"但这个方案的问题也很明显：“领导者选举服务”单点了，一个节点挂了会导致服务不可用。那么能用多实例吗？","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/8c/8cff7aa4bfcae421b2e2cf215a438827.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如上图，这是两种多实例扩展的方案。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"方案一下不同的订单实例会随机路由到不同的领导者选举服务实例，再由领导者选举服务各实例自身实现数据同步。那么怎么同步？当然可以使用数据库、Redis等中间件实现，但这导致了该服务并不纯粹，我们希望这个服务不依赖于三方服务或中间件。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"方案二下要求订单服务向所有领导者选举服务实例发送请求，只要有一个领导者选举服务实例存活服务整体就还是可用的，但这一方案的问题在于请求发送与接收存在网络时延，导致不同领导者选举服务实例收到的顺序可能是不一样的，进而无法形成统一的结果。而这正是我们所面临的最棘手的一致性问题。","attrs":{}}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"一致性算法","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"分布式架构涉及很多方面的知识，但如果要刨根问底，探寻根基的话，那么一定是","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"一致性（Consistency）算法","attrs":{}}],"attrs":{}},{"type":"text","text":"无疑了。一致性算法是分布式架构的基础，为节点伸缩提供了核心保障。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如何在多个实例中选择领导者，如何实现数据多副本存储，如何设计分布式锁，如何确定全局ID……这一切的基石都在于一致性保障。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"一致性算法很晦涩难懂，一代IT人试图用各种方式“深入浅出”地介绍一致性问题的算法实现，但真正能被大众接受的少之又少。接下来不先不介绍一致性的算法派系，也不去做严格的算法推导，我们先从问题出发，逐步深入。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们回顾上文遇到的问题：网络的时延导致各实例接收到请求的顺序可能各不相同，那么我们是否可以从时序保证上入手呢？比如将所有请求先发到MQ，再由MQ分发请求，这的确可以解决，但是要知道的是MQ本身也需要一致性支持，这是就先有鸡还是先有蛋的问题了，所以去要求严格的消息时序以解决一致性问题是不可能的。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们大致上总结一下分布式数据一致性要解决的问题：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在多实例中确定一个变量的值，一旦确定后只能获取不能修改。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"比如上文的领导者选举就是为确定 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 变量的值，所谓“确定”即要求领导者选举服务同一时间周期内（比如一个选举周期）对外输出的领导者是唯一的，即要求领导者选举服务的不同实例间对 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 的取值达成共识（同一时间周期内不能订单服务inst1拿到的是 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader = inst1","attrs":{}}],"attrs":{}},{"type":"text","text":" ，订单服务inst2拿到的是 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader = inst2","attrs":{}}],"attrs":{}},{"type":"text","text":" ），并且这同一时间周期内确定的值不能被更改（同一时间周期内在确定 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader = inst1","attrs":{}}],"attrs":{}},{"type":"text","text":" 的前提下订单服务inst3发起选举不可以更改 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"leader","attrs":{}}],"attrs":{}},{"type":"text","text":" 的取值）。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们将确定变量值的过程看做“投票”，为规范后续的用词，我们先做以下简单的定义：","attrs":{}}]},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Proposer 提案人，发起提案的请求方，比如上文的各订单服务实例","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Acceptor 投票人，负责对提案发起投票，比如上文的各领导者选举服务实例","attrs":{}}]}]}],"attrs":{}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"那么我们思考下这投票的过程中带来的实现难点：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"难点一：存在多个Proposer提案人并发请求导致接收到投票的时序无法保证","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"要解决这个难点最直接的做法是加锁，如下图；","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/c3/c31a05c3a25599b3884f83f4142db96c.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们将原本一步完成的操作分成两个步骤：","attrs":{}}]},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"准备阶段：Proposer向Accepters发起加写锁的请求Accepter收到请求返回成功或失败（已被加过锁）","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"投票阶段：Proposer在收到所有Accepter都加锁成功时发起投票各Accepter同意投票结果，形成确定性取值各Accepter释放锁","attrs":{}}]}]}],"attrs":{}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们暂不考虑算法的效率问题，这样的确可解决时序问题，但这要求所有Accepter都能响应请求显然是不合理的。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"难点二：部分Accepter故障时仍然可用","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"要解决这个问题我们只要修改加锁成功的条件为“半数以上”，如下图：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/d4/d4fe584e3ce9c73e8f2b7ed7ce585143.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"这样服务可做到2F+1的容错能力，即在2F+1个Accepter实例的服务中最多允许F个Accepter实例同时出现故障。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"既然Accepter允许故障，那么Proposer也应如此，但这述算法中如果某Proposer实例获取到锁后发生了故障即会引发死锁导致服务不可用。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"诚然我们可以为锁加过期时间（由Proposer指定本次锁的到期时间以确保可以在同一时间释放），但这样做以及加锁本身对服务的可用性/性能影响都比较大。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"难点三：任意Proposer故障时仍然可用","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们推演到现在，使用锁的方式遇到了严重的挑战，但我们可以按上述两阶段投票的方式进行改进。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"由于放弃加锁的方式，那么就不得不去直面并发请求带来的时序问题，首先想到的应该是为投票的提案带上时间戳以区别提案的前后时间。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们先以只有一个Accepter的情况分析，如下图：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/80/804e5b5fa42780d6dab19c3d40a18cb2.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对于Proposer1而言，它的流程如下：","attrs":{}}]},{"type":"numberedlist","attrs":{"start":null,"normalizeStart":1},"content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":1,"align":null,"origin":null},"content":[{"type":"text","text":"Proposer1发起了提案号为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n1","attrs":{}}],"attrs":{}},{"type":"text","text":" 的提案请求，这里要求提案号是有序递增的，多可使用时间戳组成","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":2,"align":null,"origin":null},"content":[{"type":"text","text":"Acceptor收到了提案请求，将自身的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" （收到的最大提案号）修改成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n1","attrs":{}}],"attrs":{}},{"type":"text","text":" ，并承诺不接收小于等于 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 提案号的请求","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":3,"align":null,"origin":null},"content":[{"type":"text","text":"返回提案请求允可","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":4,"align":null,"origin":null},"content":[{"type":"text","text":"Proposer1正式发起提案，内容为之前的提案号及提案的值","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":5,"align":null,"origin":null},"content":[{"type":"text","text":"Acceptor收到了提案，将自身的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"acceptN","attrs":{}}],"attrs":{}},{"type":"text","text":" （接受的提案号）更新为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n1","attrs":{}}],"attrs":{}},{"type":"text","text":" 、 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"acceptV","attrs":{}}],"attrs":{}},{"type":"text","text":" （接受的提案值，即确定的值）设置成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"v1","attrs":{}}],"attrs":{}},{"type":"text","text":" , 并承诺不处理小于 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 提案号的提案","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":6,"align":null,"origin":null},"content":[{"type":"text","text":"返回提案成功","attrs":{}}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对于Proposer2而言，它的流程如下：","attrs":{}}]},{"type":"numberedlist","attrs":{"start":null,"normalizeStart":1},"content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":1,"align":null,"origin":null},"content":[{"type":"text","text":"Proposer2发起了提案号为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":" 的提案请求","attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":2,"align":null,"origin":null},"content":[{"type":"text","text":"Acceptor收到了提案请求，将自身的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 修改成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":3,"align":null,"origin":null},"content":[{"type":"text","text":"由于已形成了确定性值，所以直接返回已确定的值","attrs":{}}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"从上面流程中可见，值的确定性是由 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"后者认可前者","attrs":{}}],"attrs":{}},{"type":"text","text":" 的原则保障，只要有确定性的值，后续的提案都会认可这个值。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我们再看复杂些的情况：","attrs":{}}]},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/16/16a20b1abe83071b1617f4b8c5e1a6df.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如上图，Proposer1与Proposer2交叉执行，它们的流程如下：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"p1-1.2.3. 同前面流程p2-1. 此时Proposer2发起了提案号为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":" 的提案请求p2-2. Acceptor收到了提案请求，将自身的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 修改成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":"p2-3. 返回提案请求允可p1-4. 此时Proposer1正式发起提案，提案号 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n1","attrs":{}}],"attrs":{}},{"type":"text","text":" 提案值 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"v1","attrs":{}}],"attrs":{}},{"type":"text","text":"p1-5. 由于已有更大的提案号 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN = n2","attrs":{}}],"attrs":{}},{"type":"text","text":" ，所以返回错误p2-4. 此时Proposer2正式发起提案，提案号 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":" 提案值 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"v2","attrs":{}}],"attrs":{}},{"type":"text","text":"p2-5. Acceptor收到了提案，将自身的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"acceptN","attrs":{}}],"attrs":{}},{"type":"text","text":" 更新为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":" 、 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"acceptV","attrs":{}}],"attrs":{}},{"type":"text","text":" 设置成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"v2","attrs":{}}],"attrs":{}},{"type":"text","text":"p2-6. 返回提案成功p1-6. 由于Proposer1的第一次提案没有通过，所以增加提案号后重新发起提案申请，提案号为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n3","attrs":{}}],"attrs":{}},{"type":"text","text":"p1-7. Acceptor收到了提案请求，将自身的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 修改成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n3","attrs":{}}],"attrs":{}},{"type":"text","text":"p1-8. 由于前面已经形成了确定性值，所以直接返回之前的提案值","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"从上面流程中可见，Proposer2可以抢占Proposer1的提案权，即后发起的提案在未形成确定性值时可以抢占现有的提案权。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"至此，我们可以容忍任意Proposer的故障，那么存在多个Acceptor时又如何呢？","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"实际上，Acceptor做的事与前面单一Acceptor场景一样，核心在于确保Proposer向所有的Acceptor发起请求，仅当超半数Acceptor返回成功时才算请求成功，否则重试。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/e0/e0502ec07f50e461946f8a27dcd4b89e.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"上图略显复杂，我们逐步分析：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"p1-1-6. 同前面流程p2-1-9. 抢占式提案，使当前各Acceptor的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 修改成 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":"p1-7.8. Proposer1向Acceptor3（网络时延）发起了提案请求，但在提案请求阶段Acceptor不接受 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"⇐ maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 的提案号，故返回错误p1-9-12. 由于超半数Acceptor返回成功（前一幅图），可以提交提案，但在提案提交阶段Acceptor不接受 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"< maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 的提案号，故返回错误p2-10-14. 此时Proposer2超半数Acceptor返回成功，可以提交提案，由于提案请求返回中都没有确定性值时，故使用Proposer2预设的值v2提交，超半数提案提交成功，故已形成确定性值p1-13-21. Proposer1更新提案号重新发起提案请求，各Acceptor更新 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"maxN","attrs":{}}],"attrs":{}},{"type":"text","text":" 为最新的提案号 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n3","attrs":{}}],"attrs":{}},{"type":"text","text":" 并返回各自已确定的值p1-22-30. 提案请求返回中存在确定性值： ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"Acceptor1的(n2, v2) 及Acceptor2的(n2, v2) ","attrs":{}}],"attrs":{}},{"type":"text","text":"使用提案号最大的确定性值做为新提案的值，对于上例是最大提案号 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"n2","attrs":{}}],"attrs":{}},{"type":"text","text":" , 对应的值为 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"v2","attrs":{}}],"attrs":{}},{"type":"text","text":"，最终Proposer1与Proposer2都得到了确定性值 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"v2","attrs":{}}],"attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如果各位能理解上述流程，那么恭喜你，你已经掌握了一致性算法中最著名的 ","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"Paxos算法","attrs":{}}],"attrs":{}},{"type":"text","text":" 核心。","attrs":{}}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"Paxos:开山鼻祖","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Paxos ，这是公认的最伟大的分布式一致算法，可能没有之一。Google的Chubby、Spanner都使用了Paxos以保证数据副本更新序列的一致性。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Paxos协议见于Leslie Lamport在1998年发的《The Part-Time Parliament》，在此论文中他假设了一个叫Paxos的小岛，岛上的各项决定要经议会同意，议会成员都是兼职的，议员的核心角色分为提案者（Proposers）、表决/投票者（Acceptors）。Proposer 提出提案（Proposal），提案信息包括提案编号和提议的值（Value），Acceptor 收到提案后可以接受（Accept）提案，若提案获得多数 Acceptors 的接受，则称该提案被批准（Chosen）。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"此论文的描述晦涩难懂，以至于很多专业人士也一头雾水，所以Lamport在2001年又发表了《Paxos Made Simple》以简化说明，但这还是过于晦涩。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Paxos 算法有很多变种，包含但不限于：Basic Paxos、Multi Paxos、Fast Paxos、Byzantine Paxos……","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"值得一提的是 Paxos 能容忍消息丢失（节点不可达）、乱序，但存储必须可靠（没有数据丢失和错误），即这是“非拜占庭算法”，而 Byzantine Paxos 则解决了拜占庭场景。关于拜占庭问题我们后文会介绍。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如果上文的图示没有看懂，那么下文我们以 Basic Paxos 这一经典算法为例写伪代码进一步阐述。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong","attrs":{}}],"text":"Basic Paxos","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Paxos流程描述的文章太多了，但文字的描述过于苍白，上文我们用示例加示意图的形式已经描述了其核心流程，下面我们再用伪代码的形式更严格地描述Basic Paxos的核心流程：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/84/841f289969734bd5b5945c47e323d10b.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/ba/ba64a363a36f2c81ba8fb19afb25e7a8.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https://static001.geekbang.org/infoq/b7/b7445e72739d5e3319f13bc8c90fcc9d.jpeg","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong","attrs":{}}],"text":"Multi Paxos","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"从上文我们可以看到Basic Paxos，算法的过程也比较复杂，确定一个值需要至少2次RPC并且可能存在活锁（即多个Proposer交替发起提案申请，见 Wikipedia Paxos （","attrs":{}},{"type":"link","attrs":{"href":"https://link.zhihu.com/?target=https%3A//en.wikipedia.org/wiki/Paxos_%28computer_science%29","title":null,"type":null},"content":[{"type":"text","text":"https://en.wikipedia.org/wiki/Paxos_(computer_science)","attrs":{}}]},{"type":"text","text":"）的Basic Paxos when multiple Proposers conflict章节，本文不赘述），所以一般的Paxos实现都是基于Multi Paxos，它只要约一次RPC，算法复杂度也低一些。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Basic Paxos之所以需要至少2次RPC是prepare阶段无法形成确定性取值，而其中的原因在于存在多个Proposer同时提案，所以Multi Paxos的核心思想是先为多个Proposer选举出Leader，后续所有的提案都由这个Leader发起，这样可以省略prepare阶段，直接发起accept。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Leader选举的过程类同Basic Paxos，需要至少2次RPC，Leader确定之后即只需要1次RPC。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"读者可能会有疑问：为什么这叫 Multi Paxos？这是个好问题，Multi Paxos要解决的问题其实不止于减少RPC调用，Basic Paxos在多轮Prepare/Accept下只能确定一个值，而Multi Paxos则可以在降低延时的同时确定多个值并且保证其顺序，这才是Multi Paxos被广泛地工程化应用的核心原因。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"关于Multi Paxos的更多介绍可参见此wiki phxpaxos ","attrs":{}},{"type":"link","attrs":{"href":"https://link.zhihu.com/?target=https%3A//github.com/Tencent/phxpaxos/wiki","title":null,"type":null},"content":[{"type":"text","text":"https://github.com/Tencent/phxpaxos/wiki","attrs":{}}]},{"type":"text","text":"。","attrs":{}}]},{"type":"embedcomp","attrs":{"type":"table","data":{"content":"

Tip	扩展阅读Paxos经典论文 https://www.microsoft.com/en-us/research/publication/paxos-made-simple/Wikipedia Paxos https://en.wikipedia.org/wiki/Paxos_(computer_science)知行学社——paxos和分布式系统 https://www.bilibili.com/video/av36134550/

"}}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"总结","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"本文我们通过图示及伪代码讲解了经典的 Paxos 算法实现原理，一致性算法还有很多，比如","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"Raft","attrs":{}}],"attrs":{}},{"type":"text","text":"、","attrs":{}},{"type":"codeinline","content":[{"type":"text","text":"Zab","attrs":{}}],"attrs":{}},{"type":"text","text":"，不同算法间实现的逻辑有很多的共通性，可以举一反三，如有必要笔者也会持续更新相关的内容。","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"关注我的公众号：","attrs":{}}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"link","attrs":{"href":"https://link.zhihu.com/?target=https%3A//mp.weixin.qq.com/s/eiiirZaJ6tl5YsyngfvuuA","title":null,"type":null},"content":[{"type":"text","text":"分布式架构的根基：深入浅出一致性算法mp.weixin.qq.com","attrs":{}}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}}]}

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

画像系统人群服务数据存储架构的演进与创新| 京东云技术团队

一、畫像系統命中接口相關簡介什麼是畫像系統標籤畫像系統是一種數據管理和分析工具，它通過整合和分析用戶的行爲數據、交易數據、社交數據等多維度信息，構建出用戶的詳細畫像，幫助咱們運營人員更好地理解目標用戶羣體，從而實現精準營銷和精細

2024-05-14 23:57:28

探索大语言模型：理解Self Attention| 京东物流技术团队

一、背景知識在ChatGPT引發全球關注之後，學習和運用大型語言模型迅速成爲了熱門趨勢。作爲程序員，我們不僅要理解其表象，更要探究其背後的原理。究竟是什麼使得ChatGPT能夠實現如此卓越的問答性能？自注意力機制的巧妙融入無疑是關鍵因素

2024-05-14 23:57:26

go-kit学习指南 - 基础概念和架构

原文：https://blog.fengjx.com/pages/40737e 介紹 go-kit 是一個微服務開發工具集，並不算一個完整的框架。根據工程實踐總結的一套開發規範，解決分佈式開發中的常見問題，它同樣也適用於單體服務開發。

2024-05-14 12:17:28

JDBC连接openGauss6.0和PostgreSQL16.2性能对比

本文分享自華爲雲社區《JDBC連接openGauss6.0和PostgreSQL16.2性能對比》，作者： Gauss松鼠會小助手。 PostgreSQL vs openGauss 01 前置準備安裝JDK：詳細安裝步驟請問度娘，輸

2024-05-14 11:00:08

Python函数与模块的精髓与高级特性

本文分享自華爲雲社區《Python函數與模塊的精髓與高級特性》，作者：檸檬味擁抱。 Python 是一種功能強大的編程語言，擁有豐富的函數和模塊，使得開發者能夠輕鬆地構建複雜的應用程序。本文將介紹 Python 中函數和模塊的基本使用方法，

2024-05-14 11:00:07

测试人员都是画画大神，让我看看谁还不会用代码图？

給大家30秒的時間，一起來思考這是什麼？這是某系統登陸模塊功能的初始類圖。隨着現代軟件的不斷複雜化，代碼圖（Code Graphs）爲測試人員提供了一種直觀的方法，讓複雜的代碼邏輯易於理解。本文將深入探討代碼圖，通過挖掘到的真實場景

2024-05-14 02:08:59

SharePoint Online 客制化开发：如何使用CSS更改网站主题背景颜色？

一般情況下公司爲了某個團隊或者公司內部共享數據等用途來更改網站的樣式，打造獨特的品牌樣式，很多研發工程師給定的解決方案是爲他們的站點構建自定義主頁，雖然SharePoint Designer是一個強大的工具，但這裏我不推薦使用ShareP

2024-05-14 02:00:35

我拍了拍Redis，被移出了群聊···

01 Redis的新煩惱你好，我是Redis，一個叫Antirez的男人把我帶到了這個世界上。自從上次被拉入羣聊之後，我就從一個人單打獨鬥變成了團隊合作，在小夥伴們的共同努力下，不僅有主從複製可以數據備份，還有哨兵節點負責監控管理

2024-05-14 01:06:44

Codeforces Round #698 (Div. 2)-C. Nezzar and Symmetric Array-题解

目錄 C. Nezzar and Symmetric Array 題目大意解題思路首先原數組中，一個數的差值和與這個數的相反數的差值和是相同的。這就需要 *條件一* 差值和們成對出現因±差

2024-05-14 00:37:33

数据结构笔记浅记（十四）树

二叉樹「二叉樹 binary tree」是一種非線性數據結構，代表“祖先”與“後代”之間的派生關係，體現了“一分爲二” 的分治邏輯。與鏈表類似，二叉樹的基本單元是節點，每個節點包含值、左子節點引用和右子節點引用。每個節點都有兩個引

2024-05-14 00:28:41

Netty实战九之单元测试

ChannelHandler是Netty應用程序的關鍵元素，所以徹底地測試他們應該是你的開發過程的一個標準部分。最佳實踐要求你的測試不僅要能夠證明你的實現是正確的，而且還要能夠很容易地隔離那些因修改代碼而突然出現的問題。這種類型的測試叫做

2024-05-14 00:19:17

企业IT架构治理之道

一、什麼是架構和治理 1.1 架構的起源開篇還是要說說大家理解的架構，何爲架構，架構跟我們的工作和生活有什麼關係。英文Architecture本源來自於拉丁語，最早起源於建築領域，建築是文明社會一個重要的標誌，同時也是人類社會最早

京東雲開發者

2024-05-13 23:59:32

Java Chassis 3：接口维度负载均衡

本文分享自華爲雲社區《Java Chassis 3技術解密：接口維度負載均衡》，作者： liubao68。在Java Chassis 3技術解密：負載均衡選擇器中解密了Java Chassis 3負載均衡在解決性能方面提供的算法。這次解密

2024-05-13 23:00:25

面试官：说说你对序列化的理解

本文主要內容背景在Java語言中，程序運行的時候，會產生很多對象，而對象信息也只是在程序運行的時候纔在內存中保持其狀態，一旦程序停止，內存釋放，對象也就不存在了。怎麼能讓對象永久的保存下來呢？--------對象序列化。何

2024-05-13 22:58:28

O2OA翱途开发平台前端API和后端API的访问以及使用

O2OA是一個高度可定製化的企業級開發平臺，它的API（應用程序接口）分爲前端和後端，各自有不同的用途，平臺爲用戶開放了全部的後端API供開發者使用，開發者可以根據各類API組織出符合實際業務需求的新服務或者新業務，用於數據查詢，業務接

2024-05-13 22:50:31

24小時熱門文章

最新文章

最新評論文章