01-Python 中的数据类型-02-字符串类型

总体要讲的大纲内容如下

数字类型- int float complex
字符串类型 Text Sequence Type- str
序列类型 - list tuple range
集合类型 - set frozenset
上下文管理器类型 - 比较复杂暂时不用掌握
文本序列类型
二进制序列类型 bytes bytesarray memoryview
真值检测
比较运算符
逻辑运算符
如何判断一个对象是什么类型- type(xxx)

今天继续讲基础类型的字符串类型，字符串类型是比较常见的。

生活中东西的名称，苹果对应就是 Apple ，橘子 orange

字符串的定义

在Python中字符串是如何定义的呢？

一般来说有三种方式定义一个字符串。单引号，双引号，和三单引号，三双引号

注意这里单引号，双引号，都是英文字符，不是中文字符。

name = 'frank'

name2 = "frank2"

name3 = '''frank'''

name4 = """frank4"""

print(name, type(name))
print(name2, type(name2))
print(name3, type(name3))
print(name4, type(name4))

这种几种方式几乎没有区别。有时候可以替换。

假设你的字符串有' 这个时候，就可以用双引号。

>>> sentence="Frank's book is here."
>>> sentence
"Frank's book is here."

>>> type(sentence)
<class 'str'>

还有你的字符串比较长，一行很难放下,有多行的时候。

这个时候可以用三引号

sentence="""The debugger caught an exception in your WSGI application.
You can now look at the traceback which led to the error.
To switch between the interactive traceback and the plaintext one, 
you can click on the "Traceback" headline. 
From the text traceback you can also create a paste of it. 
For code execution mouse-over the frame you want to debug and click on the console icon on the right side."""

>>> sentence="""The debugger caught an exception in your WSGI application.
... You can now look at the traceback which led to the error.
... To switch between the interactive traceback and the plaintext one, 
... you can click on the "Traceback" headline. 
... From the text traceback you can also create a paste of it. 
... For code execution mouse-over the frame you want to debug and click on the console icon on the right side."""
>>>

字符串常见的方法（操作）

capitalize ,title

str.capitalize() 这个方法返回一个字符串，这个字符串相对于str 开头第一个字符大写

str.title() 每个单词的首字母大写

看下面例子

>>> s = 'frank  aaa  bbb'
>>> s.capitalize()
'Frank  aaa  bbb'
>>> s.title()
'Frank  Aaa  Bbb'

encoding

str.encoding() 字符串可以进行编码，默认utf-8 编码，简单理解字符串可以转换不同的编码格式。utf-8用的比较多.

>>> name = 'frank'
>>> 
>>> name.encode(encoding="utf-8")
b'frank'
>>> name2= name.encode(encoding="utf-8")
>>> type(name2)
<class 'bytes'>

这里发现经过 str.encode 把一个str 类型转换成了一个 btyes 类型。bytes 类型也是python的基础数据类型，之后会说到。

现在要记得 str 如何转成bytes 类型使用 encode 这个方法。

位置概念

补充一点：

str 这个数据类型可以有"位置" 的概念 ,默认第一个位置是从0 开始的。

>>> name='frank'
>>> name[0]
'f'
>>> name[1]
'r'
>>> name[4]
'k'
>>> name[3]
'n'

从上面可以看出第0 号位置是’f’ , 第四号位置是 ‘k’ .

注意对应 frank 这个字符串，最大的位置是4，能不能取5呢？肯定不能，因为位置5 已经超出了 frank 的范围。看下面的例子

>>> name[5]
Traceback (most recent call last):
  File "<input>", line 1, in <module>
IndexError: string index out of range

有的时候我并不获取一个字符，而是获取一段， python中一个东西叫切片使用方法 str[a:b] , a<=x<b 这个范围

>>> sentence
'The debugger caught an exception in your WSGI application.'

>>> sentence[0:3]
'The'
>>> sentence[4:12]
'debugger'

find,rfind

str.find() 寻找一个字符串是否在 str 中，如果在返回对应最小的位置的下标(位置)，如果没有找到返回 -1

>>> sentence="""The debugger caught an exception in your WSGI application."""
>>> sentence
'The debugger caught an exception in your WSGI application.'
>>> sentence.find('an')
20
>>> sentence.find('the')
-1
>>> sentence.find('Frank')
-1
>>> sentence.find('T')
0
>>> sentence.find('debug')
4

str.rfind() 寻找一个字符串是否在 str 中，如果在返回对应最大的位置的下标(位置)，如果没有找到返回 -1

>>> # 再来看一个例子
>>> sentence ='I like swimming and I like sports'
>>> sentence.find('like')
2
>>> sentence[2]
'l'
>>> sentence.rfind('like')
22
>>> sentence[2:6]
'like'
>>> sentence[22:26]
'like'

index,rindex

str.index 这个方法和find 相似，唯一的区别是当寻找的子串没有找到的时候会报错，抛出 ValueError的异常。

>>> sentence
'The debugger caught an exception in your WSGI application.'
>>> sentence.index('an')
20
>>> sentence.index('in')
33
>>> sentence.index('on')
30
>>> sentence.index('over')
Traceback (most recent call last):
  File "<input>", line 1, in <module>
ValueError: substring not found

str.rindex() 和 str.find() 方法类似，唯一的区别是没有找到的情况下，也会抛出一个 ValueError的异常

startswith ,endswith

str.startswith(prefix) 判断字符串是否是prefix开头，返回 True ,False

str.endswith(suffix) 判断字符串是否是suffx 结尾返回 True ,False

>>> s ='apple  aaa bbb ccc  dd'
>>> s.startswith('a')
True
>>> s.startswith('app')
True
>>> s.startswith('apps')
False
>>> s.endswith('dd')
True
>>> s.endswith('d')
True
>>> s.endswith('df')
False

len 获取字符串长度

len(s) 获取字符串的长度

>>> s
'apple  aaa bbb ccc  dd'
>>> len(s)
22

replace

str.replace(old, new[, count]) 返回一个将old 替换为new 的一个字符串，如果 count 给定了值，只替换 count 次。如果没有给定count 就是全部替换。

>>> # 把 A 替换成 a 
>>> string ='aaaaAaaaAaaaAAa'
>>> string.replace('A','a')
'aaaaaaaaaaaaaaa'
>>> # 最多替换2次
>>> string.replace('A','a',2)
'aaaaaaaaaaaaAAa'

strip,lstrip,rstrip

有时候一个字符串没有那么规整，比如前后都有空格，你想去掉前面，或者的空格保留中间的部分 .

str.lstrip() 去掉左边空格，返回一个字符串

str.rstrip() 去掉右边空格，返回一个字符串

str.strip() 去掉两边边空格，返回一个字符串

>>> s ='       aaa bbb ccc    '
>>> s
'       aaa bbb ccc    '
>>> s.lstrip()
'aaa bbb ccc    '
>>> s
'       aaa bbb ccc    '
>>> s.rstrip()
'       aaa bbb ccc'
>>> s.strip()
'aaa bbb ccc'

upper,lower

大小写转换

str.upper() 转换为大写,返回一个字符串

str.lower() 转换为小写，返回一个字符串

>>> 'abc'.upper()
'ABC'
>>> 
>>> 'Frank'.upper()
'FRANK'


>>> 'FRANK'.lower()
'frank'
>>> 'FrAnk'.lower()
'frank'

str.islower()

str.isupper()

判断是不是全是大写或者小写，返回布尔值True, False

>>> 'frank'.islower()
True
>>> 'frank'.isupper()
False
>>> 'FRANK'.isupper()
True

swapcase

str.swapcase() 这个方法有点意思，就是改变字符的大小写。

原本大写字符 -> 小写字符

原本小写字符变成大写字符

>>> 'aaBBccDD'.swapcase()
'AAbbCCdd'

`+` 连接字符串

连接两个字符串用 +

>>> name ='frank'
>>> hobby = 'swimming'
>>> verb = 'likes'
>>> name + verb + hobby
'franklikesswimming'
>>> # 好丑，重新连接
>>> name +' '+ verb+' ' + hobby +'.'
'frank likes swimming.'

join

str.join(iterable) 返回一个字符串，该字符串是可迭代的字符串的串联,有点抽象，举个例子

>>> hello ='hello'
>>> world='world'


>>> ','.join(world)
'w,o,r,l,d'
>>> hello.join(world)
'whelloohellorhellolhellod'

比如用, join 一个字符串，就是用逗号将world 每一个字符连接起来。

hello.join (world) 就是用 hello 把 world 每一个字符连接起来

‘whelloohellorhellolhellod’ 就是下面的样子。

比如用加号把 hello 连接起来

>>> '+'.join(hello)
'h+e+l+l+o'

count

str.count(sub [, start[, end]]) 在 [start,end ] 范围内寻找没有子串sub ,如果有的话，出现的次数。如果没有返回0 ， start ,end 如果不指定的话，默认搜索整个字符串的范围 ,即 start=0,end =len(str)-1

注意： str 索引是从0 开始的。

>>> sentence ="hello hello world world hello hello"
>>> sentence.count('hello')
4
>>> sentence.count('aaa')
0


>>> sentence[0:17]
'hello hello world'
>>> sentence.count('hello',0,17)
2

split ,splitlines

str.split()

str.split(sep=None, maxsplit=-1) 分开的意思拆分字符串,

参数sep 就是要分隔字符的标识，maxsplit 拆分的次数，默认值是-1，就是尽可能大的次数拆分这个字符串。

>>> #用逗号拆分
>>> 'one,two,three,four'.split(',')
['one', 'two', 'three', 'four']
>>> nums='one,two,three,four'.split(',')
>>> nums
['one', 'two', 'three', 'four']
>>> type(nums)
<class 'list'>



>>> 'one,two,three,four'.split(',',maxsplit=2)
['one', 'two', 'three,four']

首先就是用逗号拆分这个字符串，发现这个字符串全部通过逗号拆开了，并且这些值放到了[ ] 里面，

通过 type 查看这个类型发现是 list, 现在你又发现了一种数据类型叫list。它可以保存一系列的数据。

maxsplit 设置切分次数。上面的例子设置2 ，那么之后的字符串就单独放在一个一起了。

splitlines 这个是以 ‘\n’ 作为换行符，并且返回一个list ,但是如果这个字符以最后一个\n 结尾，

两个方法稍微有点区别，如果使用split(’\n’)会被拆成两条数据。而 splitlines 只会是一条数据，这就是有点区别的地方。

>>> 'aaa\nbbb\nccc\nddd'.splitlines()
['aaa', 'bbb', 'ccc', 'ddd']

>>> 'aaa\nbbb\nccc\nddd'.split('\n')
['aaa', 'bbb', 'ccc', 'ddd']


>>> 'one line\n'.split('\n')
['one line', '']
>>> 'one line\n'.splitlines()
['one line']

isxxxx

isxxx 系列判断是不是某些特殊的值，返回 True ,False

这些平常用到不是特别多，但是用到的时候，只要去查一些文档就好了。

str.isascii() 是不是ASCII 吗？
str.isalnum() 所有的字符是否都是数字
str.isalpha() 所有的字符是不是都是字母
str.isdecimal() 所有的字符是不是都是小数的字符
str.isidentifier() 所有的字符都是标识符
str.isspace() 所有的字符是不是都是空格
str.isprintable() 所有的字符是不是都是可以打印的

name = '1.343'
name.isascii()
name.isalnum()
name.isalpha()
name.isdecimal()
name.isidentifier()
name.isspace()
name.isprintable()

总结

今天主要讲了字符串的表示，以及常用方法，查找，拼接，替换，统计，大小写转换等。

这里可能你不能把所有的方法都能记住, 但是用到的时候你知道如何查文档就可以了。还有今天接触了两种数据类型，一种是bytes 类型，一种是 list 类型，还记得他们是如何得到的吗？如果忘记了，赶紧翻上去，看看哦！加油！

参考文档

doc str
identifiers
count
methods

分享快乐,留住感动. 2020-03-18 20:29:39 --frank

01-Python 中的数据类型-02-字符串类型

文章目录

字符串的定义

字符串常见的方法（操作）

capitalize ,title

encoding

位置概念

find,rfind

index,rindex

startswith ,endswith

len 获取字符串长度

replace

strip,lstrip,rstrip

upper,lower

swapcase

`+` 连接字符串

join

count

split ,splitlines

isxxxx

总结

参考文档

测试人员都是画画大神，让我看看谁还不会用代码图？

Object.values()对象遍历

01-Python 中的數據類型-01-數字類型

00-陪你一起學python系列

python3 如何獲取一個文件的目錄,獲取上一級目錄

python3中的特性property介紹

02-python 基礎語法知識-03-內置函數

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

01-Python 中的数据类型-02-字符串类型

文章目录

字符串的定义

字符串 常见的方法（操作）

capitalize ,title

encoding

位置概念

find,rfind

index,rindex

startswith ,endswith

len 获取字符串长度

replace

strip,lstrip,rstrip

upper,lower

swapcase

+ 连接字符串

join

count

split ,splitlines

isxxxx

总结

参考文档

字符串常见的方法（操作）

`+` 连接字符串