18.2. `json` — JSON 编码器和解码器 ¶

2.6 版新增。

JSON (JavaScript 对象表示法) ，指定通过 RFC 4627 , is a lightweight data interchange format based on a subset of JavaScript syntax ( ECMA-262 3rd edition ).

json 暴露用户熟悉的 API 标准库 marshal and pickle 模块。

编码基本 Python 对象层次结构：

>>> import json
>>> json.dumps(['foo', {'bar': ('baz', None, 1.0, 2)}])
'["foo", {"bar": ["baz", null, 1.0, 2]}]'
>>> print json.dumps("\"foo\bar")
"\"foo\bar"
>>> print json.dumps(u'\u1234')
"\u1234"
>>> print json.dumps('\\')
"\\"
>>> print json.dumps({"c": 0, "b": 0, "a": 0}, sort_keys=True)
{"a": 0, "b": 0, "c": 0}
>>> from StringIO import StringIO
>>> io = StringIO()
>>> json.dump(['streaming API'], io)
>>> io.getvalue()
'["streaming API"]'

紧凑编码：

>>> import json
>>> json.dumps([1,2,3,{'4': 5, '6': 7}], separators=(',',':'))
'[1,2,3,{"4":5,"6":7}]'

美化打印：

>>> import json
>>> print json.dumps({'4': 5, '6': 7}, sort_keys=True,
...                  indent=4, separators=(',', ': '))
{
    "4": 5,
    "6": 7
}

解码 JSON：

>>> import json
>>> json.loads('["foo", {"bar":["baz", null, 1.0, 2]}]')
[u'foo', {u'bar': [u'baz', None, 1.0, 2]}]
>>> json.loads('"\\"foo\\bar"')
u'"foo\x08ar'
>>> from StringIO import StringIO
>>> io = StringIO('["streaming API"]')
>>> json.load(io)
[u'streaming API']

专攻 JSON 对象解码：

>>> import json
>>> def as_complex(dct):
...     if '__complex__' in dct:
...         return complex(dct['real'], dct['imag'])
...     return dct
...
>>> json.loads('{"__complex__": true, "real": 1, "imag": 2}',
...     object_hook=as_complex)
(1+2j)
>>> import decimal
>>> json.loads('1.1', parse_float=decimal.Decimal)
Decimal('1.1')

延伸 JSONEncoder :

>>> import json
>>> class ComplexEncoder(json.JSONEncoder):
...     def default(self, obj):
...         if isinstance(obj, complex):
...             return [obj.real, obj.imag]
...         # Let the base class default method raise the TypeError
...         return json.JSONEncoder.default(self, obj)
...
>>> dumps(2 + 1j, cls=ComplexEncoder)
'[2.0, 1.0]'
>>> ComplexEncoder().encode(2 + 1j)
'[2.0, 1.0]'
>>> list(ComplexEncoder().iterencode(2 + 1j))
['[', '2.0', ', ', '1.0', ']']

Using json.tool from the shell to validate and pretty-print:

$ echo '{"json":"obj"}' | python -mjson.tool
{
    "json": "obj"
}
$ echo '{1.2:3.4}' | python -mjson.tool
Expecting property name enclosed in double quotes: line 1 column 2 (char 1)

注意

JSON 是子集对于 YAML 1.2。由此模块的默认设置产生的 JSON (尤其，默认 separators 值) 还是 YAML 1.0 和 1.1 的子集。因此，此模块也可以用作 YAML 序列化器。

18.2.1. Basic Usage ¶

json. dump ( obj , fp , skipkeys=False , ensure_ascii=True , check_circular=True , allow_nan=True , cls=None , indent=None , separators=None , encoding="utf-8" , default=None , sort_keys=False , **kw ) ¶

json. dumps ( obj , skipkeys=False , ensure_ascii=True , check_circular=True , allow_nan=True , cls=None , indent=None , separators=None , encoding="utf-8" , default=None , sort_keys=False , **kw ) ¶

json. load ( fp [ , encoding [ , cls [ , object_hook [ , parse_float [ , parse_int [ , parse_constant [ , object_pairs_hook [ , **kw ] ] ] ] ] ] ] ] ) ¶

json. loads ( s [ , encoding [ , cls [ , object_hook [ , parse_float [ , parse_int [ , parse_constant [ , object_pairs_hook [ , **kw ] ] ] ] ] ] ] ] ) ¶

18.2.2. Encoders and Decoders ¶

class json. JSONDecoder ( [ encoding [ , object_hook [ , parse_float [ , parse_int [ , parse_constant [ , strict [ , object_pairs_hook ] ] ] ] ] ] ] ) ¶

JSON	Python
对象	dict
array	list
string	unicode
数字 (int)	int, long
数字 (real)	float
true	True
false	False
null	None

decode ( s ) ¶

raw_decode ( s ) ¶

class json. JSONEncoder ( [ skipkeys [ , ensure_ascii [ , check_circular [ , allow_nan [ , sort_keys [ , indent [ , separators [ , encoding [ , default ] ] ] ] ] ] ] ] ] ) ¶

Python	JSON
dict	对象
list, tuple	array
str, unicode	string
int, long, float	编号
True	true
False	false
None	null

default ( o ) ¶

encode ( o ) ¶

iterencode ( o ) ¶

18.2.3. Standard Compliance ¶

JSON 格式的指定通过 RFC 4627 . This section details this module’s level of compliance with the RFC. For simplicity, JSONEncoder and JSONDecoder subclasses, and parameters other than those explicitly mentioned, are not considered.

This module does not comply with the RFC in a strict fashion, implementing some extensions that are valid JavaScript but not valid JSON. In particular:

Top-level non-object, non-array values are accepted and output;
接受无限和 NaN (非数字) 数值并输出；
Repeated names within an object are accepted, and only the value of the last name-value pair is used.

Since the RFC permits RFC-compliant parsers to accept input texts that are not RFC-compliant, this module’s deserializer is technically RFC-compliant under default settings.

18.2.3.1. Character Encodings ¶

The RFC recommends that JSON be represented using either UTF-8, UTF-16, or UTF-32, with UTF-8 being the default. Accordingly, this module uses UTF-8 as the default for its encoding 参数。

This module’s deserializer only directly works with ASCII-compatible encodings; UTF-16, UTF-32, and other ASCII-incompatible encodings require the use of workarounds described in the documentation for the deserializer’s encoding 参数。

The RFC also non-normatively describes a limited encoding detection technique for JSON texts; this module’s deserializer does not implement this or any other kind of encoding detection.

As permitted, though not required, by the RFC, this module’s serializer sets ensure_ascii=True by default, thus escaping the output so that the resulting strings only contain ASCII characters.

18.2.3.2. Top-level Non-Object, Non-Array Values ¶

The RFC specifies that the top-level value of a JSON text must be either a JSON object or array (Python dict or list ). This module’s deserializer also accepts input texts consisting solely of a JSON null, boolean, number, or string value:

>>> just_a_json_string = '"spam and eggs"'  # Not by itself a valid JSON text
>>> json.loads(just_a_json_string)
u'spam and eggs'

This module itself does not include a way to request that such input texts be regarded as illegal. Likewise, this module’s serializer also accepts single Python None , bool , numeric, and str values as input and will generate output texts consisting solely of a top-level JSON null, boolean, number, or string value without raising an exception:

>>> neither_a_list_nor_a_dict = u"spam and eggs"
>>> json.dumps(neither_a_list_nor_a_dict)  # The result is not a valid JSON text
'"spam and eggs"'

This module’s serializer does not itself include a way to enforce the aforementioned constraint.

18.2.3.3. Infinite and NaN Number Values ¶

The RFC does not permit the representation of infinite or NaN number values. Despite that, by default, this module accepts and outputs Infinity , -Infinity ，和 NaN as if they were valid JSON number literal values:

>>> # Neither of these calls raises an exception, but the results are not valid JSON
>>> json.dumps(float('-inf'))
'-Infinity'
>>> json.dumps(float('nan'))
'NaN'
>>> # Same when deserializing
>>> json.loads('-Infinity')
-inf
>>> json.loads('NaN')
nan

在序列化器中， allow_nan 参数可用于更改此行为。在反序列化器中， parse_constant 参数可用于更改此行为。

18.2.3.4. Repeated Names Within an Object ¶

The RFC specifies that the names within a JSON object should be unique, but does not specify how repeated names in JSON objects should be handled. By default, this module does not raise an exception; instead, it ignores all but the last name-value pair for a given name:

>>> weird_json = '{"x": 1, "x": 2, "x": 3}'
>>> json.loads(weird_json)
{u'x': 3}

The object_pairs_hook 参数可用于更改此行为。

18.2. `json` — JSON 编码器和解码器 ¶

18.2.1. Basic Usage ¶

18.2.2. Encoders and Decoders ¶

18.2.3. Standard Compliance ¶

18.2.3.1. Character Encodings ¶

18.2.3.2. Top-level Non-Object, Non-Array Values ¶

18.2.3.3. Infinite and NaN Number Values ¶

18.2.3.4. Repeated Names Within an Object ¶

内容表

上一话题

下一话题

本页

快速搜索

18.2. json — JSON 编码器和解码器 ¶

18.2.1. Basic Usage ¶

18.2.2. Encoders and Decoders ¶

18.2.3. Standard Compliance ¶

18.2.3.1. Character Encodings ¶

18.2.3.2. Top-level Non-Object, Non-Array Values ¶

18.2.3.3. Infinite and NaN Number Values ¶

18.2.3.4. Repeated Names Within an Object ¶

内容表

上一话题

下一话题

本页

快速搜索

18.2. `json` — JSON 编码器和解码器 ¶