20.16. `urlparse` — 将 URL 剖析成组件 ¶

注意

The urlparse module is renamed to urllib.parse in Python 3. The 2to3 tool will automatically adapt imports when converting your sources to Python 3.

源代码： Lib/urlparse.py

此模块定义的标准接口能将 URL (统一资源定位符) 字符串分解成组件 (编址方案、网络位置、路径等)，将组件组合回 URL 字符串，及将给定基 URL 的相对 URL 转换成绝对 URL。

模块被设计成匹配互联网 RFC 相对 URL (统一资源定位符)。它支持下列 URL 方案： file , ftp , gopher , hdl , http , https , imap , mailto , mms , news , nntp , prospero , rsync , rtsp , rtspu , sftp , shttp , sip , sips , snews , svn , svn+ssh , telnet , wais .

New in version 2.5: 支持 sftp and sips schemes.

The urlparse 模块定义了下列函数：

urlparse. urlparse ( urlstring [ , scheme [ , allow_fragments ] ] ) ¶

属性	索引	值	值若不存在
`scheme`	0	URL 方案说明符	空字符串
`netloc`	1	网络位置部分	空字符串
`path`	2	分层路径	空字符串
`params`	3	Parameters for last path element	空字符串
`query`	4	查询组件	空字符串
`fragment`	5	片段标识符	空字符串
`username`		用户名	`None`
`password`		口令	`None`
`hostname`		主机名 (小写)	`None`
`port`		Port number as integer, if present	`None`

urlparse. parse_qs ( qs [ , keep_blank_values [ , strict_parsing ] ] ) ¶

urlparse. parse_qsl ( qs [ , keep_blank_values [ , strict_parsing ] ] ) ¶

urlparse. urlunparse ( parts ) ¶

urlparse. urlsplit ( urlstring [ , scheme [ , allow_fragments ] ] ) ¶

属性	索引	值	值若不存在
`scheme`	0	URL 方案说明符	空字符串
`netloc`	1	网络位置部分	空字符串
`path`	2	分层路径	空字符串
`query`	3	查询组件	空字符串
`fragment`	4	片段标识符	空字符串
`username`		用户名	`None`
`password`		口令	`None`
`hostname`		主机名 (小写)	`None`
`port`		Port number as integer, if present	`None`

urlparse. urlunsplit ( parts ) ¶

urlparse. urljoin ( base , url [ , allow_fragments ] ) ¶

urlparse. urldefrag ( url ) ¶

另请参阅

RFC 3986 - 统一资源标识符: This is the current standard (STD66). Any changes to urlparse module should conform to this. Certain deviations could be observed, which are mostly due backward compatiblity purposes and for certain de-facto parsing requirements as commonly observed in major browsers.
RFC 2732 - Format for Literal IPv6 Addresses in URL’s.: This specifies the parsing requirements of IPv6 URLs.
RFC 2396 - Uniform Resource Identifiers (URI): Generic Syntax: Document describing the generic syntactic requirements for both Uniform Resource Names (URNs) and Uniform Resource Locators (URLs).
RFC 2368 - The mailto URL scheme.: Parsing requirements for mailto url schemes.
RFC 1808 - Relative Uniform Resource Locators: This Request For Comments includes the rules for joining an absolute and a relative URL, including a fair number of “Abnormal Examples” which govern the treatment of border cases.
RFC 1738 - Uniform Resource Locators (URL): This specifies the formal syntax and semantics of absolute URLs.

20.16.1. Results of `urlparse()` and `urlsplit()` ¶

结果对象来自 urlparse() and urlsplit() 函数是子类化的 tuple type. These subclasses add the attributes described in those functions, as well as provide an additional method:

ParseResult. geturl ( ) ¶

The following classes provide the implementations of the parse results:

class urlparse. BaseResult ¶

class urlparse. ParseResult ( scheme , netloc , path , params , query , fragment ) ¶

class urlparse. SplitResult ( scheme , netloc , path , query , fragment ) ¶

20.16. `urlparse` — 将 URL 剖析成组件 ¶

20.16.1. Results of `urlparse()` and `urlsplit()` ¶

内容表

上一话题

下一话题

本页

快速搜索

20.16. urlparse — 将 URL 剖析成组件 ¶

20.16.1. Results of urlparse() and urlsplit() ¶

内容表

上一话题

下一话题

本页

快速搜索

20.16. `urlparse` — 将 URL 剖析成组件 ¶

20.16.1. Results of `urlparse()` and `urlsplit()` ¶