python 中序列化方式的比较（pickle, json）

相同点，都能够用来序列化数据，返回字符串。
下面主要是来说明一下区别

性能

通过简单测试看到序列化相同数据，json速度显然要快一些，大约是20~30倍，同时得到的字符串的体积上也有不同
json 相对较小，数据结构不同，差别也有不同。

总结来说，json 性能较好（对于另外的类型还需验证）如果必要使用pickle ，可以使用更快的cPickle。

适用性

json 适合跨语言的数据交换，pickle 是适用于python的数据的序列化.

区别就是json适用的数据只是一些基本类型，pickle 可以将python 里面的类型都给序列化（包括函数，类，实例）。

所以就是在使用的时候，涉及到跨语言或是简单类型的数据交换 使用json。对于python 语言之间，类型复杂的数据交换则用pickle。

下面是测试的代码和 pickle 序列化数据的时候的关键内容：

下面主要是序列化了10万列表类型的数据和字典类型的数据

# coding=utf8
# Time    : 2020/1/15 14:17
# File    : json_pickle.py
# Software: PyCharm
import json
import pickle
import time
from contextlib import wraps


def tt(func):
    @wraps(func)
    def wrapper(*args, **kwargs):
        _s = time.time()
        value = func(*args, **kwargs)
        _e = time.time()
        print _e - _s, len(value)
        return value

    return wrapper


@tt
def test_json(objs):
    ts = json.dumps(objs)
    return ts


@tt
def test_pickle(objs):
    ts = pickle.dumps(objs)
    return ts


def test(x):
    test_json(x)
    test_pickle(x)
    print "--" * 30


if __name__ == '__main__':
    l = 100000
    x = [i for i in range(l)]
    print "list:"
    test(x)

    d = dict([(str(i), i) for i in range(l)])
    print "dict:"
    test(d)

下面是代码来自pickle源码:

# pickle.py

class Pickler:
    def save_dict(self, obj):
        write = self.write

        if self.bin:
            write(EMPTY_DICT)
        else:   # proto 0 -- can't use EMPTY_DICT
            write(MARK + DICT)

        self.memoize(obj)
        self._batch_setitems(obj.iteritems())

    dispatch[DictionaryType] = save_dict
    if not PyStringMap is None:
        dispatch[PyStringMap] = save_dict

    def _batch_setitems(self, items):
        # Helper to batch up SETITEMS sequences; proto >= 1 only
        pass

    def save_inst(self, obj):
        pass

    dispatch[InstanceType] = save_inst

    def save_global(self, obj, name=None, pack=struct.pack):
        """代码省略可以去源码查看"""
        pass

    dispatch[ClassType] = save_global
    dispatch[FunctionType] = save_global
    dispatch[BuiltinFunctionType] = save_global
    dispatch[TypeType] = save_global

hpulfc

发布了131 篇原创文章 · 获赞 153 · 访问量 34万+

私信关注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

python 中序列化方式的比较

python 中序列化方式的比较（pickle, json）

性能

适用性

vue绑定对象，绑定的值不改变的问题

Spring Cloud 部署时如何使用 Kubernetes 作为注册中心和配置中心

KubeKey 部署 K8s v1.28.8 实战

记一些CISP-PTE题目解析

如何使用tensor2tensor部署一個預測任務

docker 之安裝卸載、mysql、redis、mongo、服務的運行

python 常用庫整理（yml、ujson、hashlib、base64、cgi、markdown篇）

線程同步中的一些鎖、條件、信號量的解析

python 中判斷一個對象是否爲函數

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結