Python笔记（自用）

hhhhhhhhhhhh该笔记持续更新，仅更新自用部分，如有错误请在评论区指正，谢谢🙏

首次博客更新2025-12-29

类型注解#

给变量加类型注解

1
name: str = "Tom"
2
age: int = 18
3
price: float = 3.14
4
flag: bool = True

列表，字典：

1
nums: list[int] = [1, 2, 3]
2
names: list[str] = ["a", "b"]
3

4
info: dict[str, int] = {"Tom": 18}

返回值注解

1
def add(a: int, b: int) -> int:
2
    return a + b

Literal注解

1
from typing import Literal
2
status: Literal["success", "fail", "pending"]

Optional注解

1
from typing import Optional
2

3
def get_name() -> Optional[str]:
4
    return None

Any

1
from typing import Any
2

3
data: Any = 123
4
data = "abc"
5
data = []

Callable

1
Callable[[参数类型列表], 返回类型]

示例：

1
from typing import Callable
2

3
def apply(f: Callable[[int, int], int], a: int, b: int) -> int:
4
    return f(a, b)

虚拟环境#

1
python -m venv .venv

基本语法#

input返回str类型

1
name = input("请输入你的名字：")
2
print("你好，" + name)

类型转换

1
int("666")

1
print("知道了，你今年" + str(2) + "岁了！")

条件控制#

1
mood_index = int(input("Enter mood index: "))
2
if mood_index >= 60:
3
    print("congratulation!")
4
elif 30 <mood_index < 60:
5
    print("Nromal.")
6
else:
7
    print("Sorry.")

布尔类型#

唯一实例#

唯一实例：True 和 False 是全局唯一的单例对象

1
a = True
2
b = True
3
print(a is b)   # True
4

5
x = False
6
y = False
7
print(x is y)   # True

不可变#

一旦创建，不能修改它的值：

1
a = True
2
# a[0] = False    # ❌ TypeError

切片#

用法：序列[start : stop : step]

含义：从 start 开始，到 end 结束（不包含 end），步长为 step

step为负数（反向切片）：

1
a = [0,1,2,3,4]
2

3
print(a[::-1])

1
[4,3,2,1,0]

列表#

1
shopping_list = ["apple", "banana", "cherry"]
2
shopping_list.append("orange")
3
print(shopping_list[0])
4

5
price = [100,200,301,129]
6
max_price = max(price)
7
min_price = min(price)
8
sorted_price = sorted(price)
9
print(max_price)
10
print(min_price)
11
print(sorted_price)

方法：

对象.方法名(…)

函数：

函数名.(对象)

列表的基操方法：

append()——末尾添加一个元素
extend()——添加多个元素（参数必须是可迭代对象）
insert(a,b)——指定位置插入

1
a = [1, 2, 3]
2
a.insert(1, 100)
3
print(a)    # [1, 100, 2, 3]

插入会导致后续元素整体右移，效率较低。

remove()——按值删除
pop()——按下标删除并返回（不写参数默认删除最后一个：）

1
a = [1, 2, 3]
2
x = a.pop(1)
3
print(x)    # 2
4
print(a)    # [1, 3]

clear()——清空列表
reverse()—— 逆向输出
copy()—— 列表的拷贝
a.sort(reverse=True)—— （默认升序）列表排列

函数：

sorted(words, key=len, reverse=False)——返回新列表的排序
enumerate(可迭代对象, start=0)

示例：

1
lst = ['a', 'b', 'c']
2

3
for i, v in enumerate(lst):
4
    print(i, v)

输出：

列表生成式#

[ 表达式 for 变量 in 可迭代对象 ]

`map`函数#

1
map(function, iterable)

把一个函数“批量作用”到可迭代对象的每个元素上，map返回的是迭代器

例：

1
nums = [1, 2, 3, 4]
2

3
res = map(lambda x: x * 2, nums)
4
print(list(res))

相当于

1
[x * 2 for x in nums]

元组#

在 Python 里，用逗号分隔的多个值会被自动打包成一个元组，即使没有外面的圆括号。

使用内置函数创建元组：tuple()

遍历输出元组元素：

1
print(len(t))
2
for i in range(len(t)):
3
    print(i,t[i])

删除元组：del 元组名

字典#

1
contacts = {
2
    ("xioamin",23):"111111111",
3
    ("xiaohua",24):"222222222"
4
}
5
del contacts[("xioamin",23)]
6
contacts[("xiaomei",24)] = "123131"
7
print(contacts)
8
print(len(contacts))

访问字典：
1. 使用[]访问

1
student_grades = {
2
    "语文": 90,
3
    "数学": 95,
4
    "英语": 88,
5
    "科学": 92
6
}
7
# 访问 '数学' 键对应的值
8
math_score = student_grades["数学"]
9
print(math_score)  # 输出: 95

1
2. 使用`.get(key, default=None)` 方法  （`default`为键不存在时返回的默认值）

items()方法：**无参数，**返回返回字典中所有键值对组成的可迭代视图，每个元素是 (key, value) 元组，可以用 list() 转换成列表
字典的遍历：拆开键和值

1
for k, v in d.items():
2
    print(k, v)

pop() 用于 删除字典中的某个键，并返回对应的值。
popitem()——随机删除
clear()——清空字典中所有元素
dict.update([other]) other：可选参数，可以是另一个字典、可迭代的键值对，或关键字参数。返回 None，原地修改调用字典。

字典生成式#

d={key:value for item in range}
d-{key:value for key,value in zip(lst1,lst2)}

集合#

集合（set）是一种“无序、不重复”的数据结构，用来存储一组唯一的元素。 (不能存储字典和列表）

创建方式：

使用花括号{}
使用set()函数——s=set(可迭代对象)

删除：del 集合对象

集合操作符：#

操作符	含义	示例	结果
`	`	并集	`{1,2}
`&`	交集	`{1,2} & {2,3}`	`{2}`
`-`	差集	`{1,2} - {2,3}`	`{1}`
`^`	交集的补集	`{1,2} ^ {2,3}`	`{1,3}`
`<=`	子集	`{1} <= {1,2}`	`True`
`<`	真子集	`{1} < {1,2}`	`True`
`>=`	超集	`{1,2} >= {1}`	`True`
`>`	真超集	`{1,2} > {1}`	`True`

集合操作方法：#

s.add(x)——如果x不在s集合中，添加到s集合中

s.remove(x)——将集合s中x删除

s.clear()——清空集合中所有元素

结构模型匹配#

1
match data:
2
    case pattern1:
3
        ...
4
    case pattern2:
5
        ...

1
match data:
2
    case {}:
3
        print("空字典")
4
    case []:
5
        print("空列表")
6
    case ():
7
        print("空元组")
8
    case _:
9
        print("其他情况")

多值匹配:

1
match data1, data2:
2
    case 1, 2:
3
        print("data1=1, data2=2")

异常处理#

1
try:
2
    xxx() #自定义函数
3

4
except Exception as e:  #不写默认为Exception
5
    print("输入错误")
6
    raise #层层向上报错

finally 中抛出的异常会覆盖 try 中的异常

except 后接异常类型

1
BaseException
2
 ├── SystemExit
3
 ├── KeyboardInterrupt
4
 └── Exception
5
       ├── TypeError #类型不对
6
       ├── ValueError #值不合法
7
       ├── IndexError #下标越界
8
       ├── KeyError #字典没这个键
9
       └── FileNotFoundError #文件不存在

** 同时捕获多个异常** :

1
try:
2
    x = int(input())
3
except (ValueError, TypeError):
4
    print("输入不合法")

用元组
只要命中其中一个就进 except

1
try:
2
    代码
3
except 异常类型:
4
    处理
5
else:
6
    没有异常执行
7
finally:
8
    一定执行

字符串#

方法#

大小写转换

1
s = "PyThOn"
2

3
s.lower()      # 'python'
4
s.upper()      # 'PYTHON'
5
s.title()      # 'Python'
6
s.capitalize() # 'Python'

去除空白

1
s = "  hello  "
2

3
s.strip()     # 'hello'
4
s.lstrip()    # 'hello  '
5
s.rstrip()    # '  hello'

split拆分字符串 **字符串 → 列表 **

1
s = "1001-苹果-10"
2

3
parts = s.split("-")
4
print(parts)

join进行拼接

1
"分隔符".join(可迭代对象)

示例：

1
words = ["Python", "is", "fun"]
2
result = " ".join(words)  # 空格分隔
3
print(result)

find()检索字符首次出现的位置（从0开始），不存在返回-1
index()检索字符首次出现的位置（从0开始），不存在则报错
count()
replace(_old,_new,_count)方法，将old字符串替换成new，替换次数为count（默认全部替换）
str.center(width[, fillchar])方法，把字符串放到指定宽度中间，不够的地方用指定字符填充。
eval函数去掉最外层的引号，可变为整型
repr(obj)函数，和 str() 不同，repr()会把“看不见的字符”显示出来

格式化字符串#

%格式

1
name = "Alice"
2
age = 20
3
height = 1.68
4

5
# 字符串和整数
6
print("我叫%s，今年%d岁" % (name, age))
7

8
# 浮点数保留两位小数
9
print("我的身高是%.2f米" % height)

str.format()

1
name = "Alice"
2
age = 20
3
height = 1.6789
4

5
# 按顺序填充
6
print("我叫{}，今年{}岁".format(name, age))
7

8
# 使用关键字
9
print("我叫{name}，身高{h:.2f}米".format(name=name, h=height))
10

11
# 指定位置
12
print("名字:{0}, 年龄:{1}, 再次打印名字:{0}".format(name, age))

1
s="helloworld"
2
print('{0:*<20}'.format(s)) #左对齐
3
print('{0:*>20}'.format(s)) #右对齐
4
print('{0:*^20}'.format(s)) #居中对齐
5

6
#print(s.center(20,'*'))

f-string

字符串的编码和解码#

编码 encode：str → bytes，解码 decode：bytes → str

1
s='你好世界'
2
scode=s.encode(encoding='utf-8')
3
scode2=scode.decode(encoding='utf-8')
4
print(scode)
5
print(scode2)

数据的验证#

方法	作用
`.isdigit()`	判断字符串是否只包含数字
`.isalpha()`	判断是否只包含字母
`.isalnum()`	判断是否只包含字母和数字
`.isspace()`	判断是否只包含空格
`.startswith()` / `.endswith()`	判断开头或结尾

`enumerate()`函数#

1
enumerate(iterable, start=0)

iterable：可迭代对象（列表、字符串、元组等）
start：索引起始值，默认是 0
eg：将[a, b, c]变成(0, a), (1, b), (2, c)

深拷贝和浅拷贝#

浅拷贝：创建一个容器，容器里的元素地址不变

1
import copy
2

3
lst1 = [1, 2, [3, 4]]
4

5
lst2 = lst1.copy()        # 方法一
6
lst3 = lst1[:]            # 方法二
7
lst4 = copy.copy(lst1)    # 方法三

深拷贝：创建一个**完全独立的新对象，所有层级都复制 **

1
import copy
2

3
lst1 = [1, 2, [3, 4]]
4
lst2 = copy.deepcopy(lst1)

推导式#

推导式（Comprehension） 是一种用更简洁、更优雅的语法来生成列表、字典、集合或生成器的方式。

推导式 = 简洁版的 for 循环 + 条件判断，用来构建新的数据结构。

例如：

1
nums= [1,2,3,4,5,6]
2
my_list = [i for i in nums]

第一个i表示为需要添加到新列表中的元素，改为i**2则为i的平方添加到my_list

函数#

参数类型（很重要也很实用）#

普通参数

1
def f(a, b):
2
    ...

默认参数

1
def greet(name="Nanzhi"):
2
    print("Hello", name)

匿名函数#

lambda arguments: expression

示例：

1
x = lambda a : a + 10
2
print(x(5))

可变参数（不定长）
args — 接收多个位置参数

1
def sum_all(*nums):
2
    print(nums)

nums 是一个元组。

kwargs — 接收多个键值对参数

1
def info(**data):
2
    print(data)

data 是一个字典。

1
lambda 参数1, 参数2, ...: 表达式

1
f = lambda x: x + 1
2
print(f(5))  # 6

局部变量和全局变量

1
x = 10  # 全局变量
2

3
def foo():
4
    y = 5  # 局部变量

函数内部不能直接修改全局变量，除非用 global：

1
def foo():
2
    global x
3
    x = 20

函数也是“第一类对象”（很高级但很重要）

Python 中函数可以：

赋值给变量
当作参数传给函数
放进列表、字典
作为返回值返回

例子：

1
def hello():
2
    print("Hi")
3

4
f = hello
5
f()   # 等价于 hello()

内置函数#

isinstance(object, classinfo)：object ：要检查的对象，classinfo ：类型或类型元组（可以同时检查多种类型），返回值：布尔值（ True 或 False ）

面向对象#

类是模版，实例/对象是根据模板生成的具体东西，类定义属性和行为，对象拥有这些属性并能执行这些行为。

类定义写法：

1
class 类名:
2
    # 类体（属性 + 方法）
3
    pass

定义类属性 & 实例属性#

1
class Student:
2
    school = "xxx_university" #类属性
3

4
    def __init__(self,name):
5
        self.name = name #实例属性

类型	属于谁	访问方式
类属性	类本身所有对象共享	`Person.species`
实例属性	每个对象独立持有	`p.name`

动态绑定属性：

1
a.gender = '男'  #动态添加属性
2
print(f"我是{a.gender}生")

实例方法#

定义在类中的函数，称为方法，自带一个参数self

实例方法如示例调用

1
    def show(self,age):
2
        self.age = age
3
        print(f"我今年{se  lf.age}岁了")
4

5
a=Student("xiaomin")
6
a.show(18)

静态方法#

使用@staticmethod修饰的方法，不需要self/cls属性
不能调用实例属性和实例方法

1
class Person:
2
    @staticmethod
3
    def test():
4
        print("静态方法不能使用实例属性和实例方法")

类方法#

cls指向类本身

1
class MyClass:
2
    @classmethod
3
    def my_class_method(cls):
4
        print(cls)

类方法可以通过 类或实例 修改类属性。

动态绑定方法：

1
a.func=introduce #函数的一个赋值
2
#func是a对象的方法
3
a.func() #调用

创建类对象与调用#

传入两个参数因为__init__方法中有两个形参，是自带的属性，无需手动传入
实例属性、类属性通过打点调用
实例方法使用对象名进行打点调用
静态方法、类方法使用类名打点调用

1
stu=Student('nz',18) #创建对象
2
print(stu.name,stu.age) #调用实例属性
3
print(Student.school) #调用类属性
4
stu.show() #调用实例方法
5
Student.sm() #调用静态方法
6
Student.cm() #调用类方法

权限控制#

1
class Student():
2
    def __init__(self,name,age,gender):
3
        self._name = name #self_name受保护，只能本类和子类访问
4
        self.__age = age #self.__age表示私有的，只能类本身去访问
5
        self.gender = gender #普通的实例属性
6

7
    def _func1(self): #受保护的
8
        print("子类及本身可以访问")
9

10
    def __func2(self): #私有的
11
        print("只有定义的类可以访问")
12

13
    def show(self): #普通的实例方法
14
        self._func1() #类本身访问受保护的方法
15
        self.__func2() #类本身访问私有方法
16
        print(self._name) #受保护的实例属性
17
        print(self.__age) #私有实例属性
18

19
stu=Student("xiaomin",18,"男")
20
stu.show()

访问私有成员#

示例：

1
class Person:
2
    def __init__(self):
3
        self.__age = 18
4

5
    def __secret(self):
6
        print("这是秘密")
7

8
    def get_age(self):
9
        return self.__age

方法一：

1
p = Person()
2
print(p.get_age())

方法二（可行但不推荐）——名称重整：

1
__attr  →  _类名__attr

访问私有属性：print(p._Person__age)
访问私有方法：p._Person__secret()

属性的设置#

@property

**只读属性（getter） **

1
class Person:
2
    def __init__(self, age):
3
        self._age = age   # 内部属性
4

5
    @property
6
    def age(self):
7
        return self._age

使用方式：

1
p = Person(18)
2
print(p.age)   # 看起来是属性
3
#实际上执行 Person.age(p)

**可写属性（setter） **

1
class Person:
2
    def __init__(self, age):
3
        self._age = age
4

5
    @property
6
    def age(self):
7
        return self._age
8

9
    @age.setter #属性名.setter
10
    def age(self, value):
11
        if value < 0:
12
            raise ValueError("age must be non-negative")
13
        self._age = value

1
p.age = 20   # 自动调用 setter

继承#

它指的是定义一个新类，而对现有类的进行很少修改或没有修改。新类称为派生（或子）类，而从其继承的新类称为基（或父）类。

1
class Person:
2
    def __init__(self,name,age):
3
        self.name = name
4
        self.age = age
5
    def show(self):
6
        print(f"大家好，这里是{self.name},今年{self.age}")
7

8
#Student继承Person类
9
class Student(Person):
10
    def __init__(self,name,age,id):
11
        super().__init__(name,age) #调用父类的初始化方法
12
        self.id = id
13

14
#Doctor继承Person类
15
class Doctor(Person):
16
    def __init__(self, name, age,department):
17
        super().__init__(name, age)
18
        self.department = department
19

20
#创建一个子类对象
21
stu=Student("xiaomin",18,"1001")
22
stu.show()
23

24
doctor=Doctor("zhangsan",30,"内科")
25
doctor.show()

多继承#

1
class FatherA():
2
    def __init__(self,name):
3
        self.name = name
4

5
    def showA(self):
6
        print(f"我是父类A中的方法")
7

8
class FatherB():
9
    def __init__(self,age):
10
        self.age = age
11

12
    def showB(self):
13
        print(f"我是父类B中的方法")
14

15
#多继承
16
class Son(FatherA,FatherB):
17
    def __init__(self,name,age,id):
18
        FatherA.__init__(self,name) #调用父类A的初始化方法
19
        FatherB.__init__(self,age)  #调用父类B的初始化方法
20
        self.id = id
21

22
son=Son("xiaoming",18,"1001")
23
son.showA()
24
son.showB()

方法重写#

优先调用子类的方法

1
class Person:
2
    def __init__(self,name,age):
3
        self.name = name
4
        self.age = age
5
    def show(self):
6
        print(f"大家好，这里是{self.name},今年{self.age}")
7

8
#Student继承Person类
9
class Student(Person):
10
    def __init__(self,name,age,id):
11
        super().__init__(name,age) #调用父类的初始化方法
12
        self.id = id
13

14
    def show(self):
15
        super().show() #调用父类中的方法
16
        print(f"我的学号是{self.id}")
17

18

19
#Doctor继承Person类
20
class Doctor(Person):
21
    def __init__(self, name, age,department):
22
        super().__init__(name, age)
23
        self.department = department
24

25
#创建一个子类对象
26
stu=Student("xiaomin",18,"1001")
27
stu.show()
28

29
doctor=Doctor("zhangsan",30,"内科")
30
doctor.show()

构造函数#

构造函数（constructor）指的是类中的一个特殊方法，其基本形式为：

1
class Person:
2
    def __init__(self, name, age):
3
        self.name = name    # 给对象绑定属性
4
        self.age = age
5

6
p = Person("Tom", 20)   # 自动调用 __init__

1
class CuteCat:
2
    def __init__(self,cat_name,cat_age,cat_color):
3
        self.name = cat_name
4
        self.age = cat_age
5

6
cat1 = CuteCat("jojo", 12,"orange")
7
print(f"{cat1.name} is {cat1.age} years old")

构造函数的作用：

✔ 设置对象属性

1
self.name = name

✔ 初始化对象状态

1
self.score = 0

无参数构造函数示例：

1
class Test:
2
    def __init__(self):
3
        print("构造函数被调用了")
4

5
t = Test()

文件操作#

上下文管理器#

上下文管理器（Context Manager）就是一个能被 with 语句管理进入和退出过程的对象。

换句话说：

只要一个对象定义了 **__enter__** 和 **__exit__** 方法，它就是上下文管理器。

1
class MyOpen:
2
    def __init__(self,filepath):
3
        print("Entering constructor of MyOpen")
4
        self.filepath = filepath
5

6
    def __enter__(self):
7
        print("Ent  ering __enter__ of MyOpen")
8
        return self.filepath
9

10
    def __exit__(self,exc_type,exc_value,traceback):
11
        print("Entering __exit__ of MyOpen")
12

13
with MyOpen('data.txt') as file:
14
    print(f"the value of file is {file}")

上下文管理器必须实现两个方法#

__enter__(self) ：进入 with 块之前自动执行，返回值会赋给 as 后面的变量
__exit__(self, exc_type, exc_val, exc_tb)：结束 with 块后自动执行，无论是否发生异常都会执行，可以决定要不要让异常继续向外抛出

with的执行流程：

1
1. 调用 obj = 表达式
2
2. 调用 obj.__enter__()
3
3. 执行 with 代码块
4
4. 调用 obj.__exit__()

文件读取#

read：读取文件的全部内容
readline()：逐行读取
readlines()：读取所有行并存储为**列表 **

1
f = open("./data.txt","r",encoding="utf-8")
2
print(f.read()) #内括号可以写入读取的字节数

1
f = open("./data.txt","r",encoding="utf-8")
2
lines = f.readlines()
3
for line in lines:
4
    print(line)
5
f.close() #关闭文件，释放资源

使用with关键字（with 语句最典型的用途是处理那些需要在使用后明确释放或关闭的资源）：with 表达式 as 变量:

文件操作就是一个上下文管理器

1
# data.txt 在 with 块结束后，无论是否发生错误，都会自动调用 file.close() 方法
2
with open('data.txt', 'r', encoding='utf-8') as file:
3
    content = file.read()
4
    print(content)
5

6
# 在 with 块外部，文件已经被安全关闭

文件写入#

写入模式 ‘w’：如果文件存在，内容会被清空
使用’a’参数打开，为附加模式
使用’r+’参数，同时支持读写文件

1
with open('output.txt', 'w', encoding='utf-8') as f:
2
    # 写入第一行，必须手动添加换行符
3
    f.write("这是写入文件的第一行。\n")
4

5
    # 写入第二行
6
    f.write("这是第二行内容。\n")

1
# 'r+': 读写模式，指针在开头
2
with open("./data.txt", "r+", encoding="utf-8") as f:
3
    f.write("hello!\n") # 写入，指针移动到写入内容的末尾
4
    f.write("yoo")     # 继续写入，指针移动到新的末尾
5

6
    # --- 关键步骤 ---
7
    # 将文件指针移回文件开头 (位置 0)
8
    f.seek(0)
9

10
    # 重新读取整个文件的内容
11
    print(f.read())

模式	含义
`"r"`	只读（文本）
`"w"`	只写（文本，覆盖）
`"a"`	追加（文本）
`"rb"`	二进制读
`"wb"`	二进制写
`"ab"`	二进制追加
`"r+"`	读写

打包与解包#

单*#

* (单个星号) 的作用：主要用于**序列（如列表、元组）**的打包和解包。

1
numbers = [1,2,3,4,5]
2
first,*rest = numbers
3
print(first)
4
print(rest)

1
1
2
[2, 3, 4, 5]

打包成一个序列：

1
def print_values(*args):
2
    for arg in args:
3
        print(arg)
4

5
print_values(1,2,3,4)

解包：

1
list1 =[1,2,3]
2
tuple1 = (4,5,6)
3
merged = [*list1,*tuple1]
4
print(merged)
5
# [1, 2, 3, 4, 5, 6]

将person解包成Alice和30两个值：

1
def greet(name,age):
2
    print(f"hello{name},you are {age} years old")
3

4
person =('Alice',30)
5
greet(*person)

双**#

** 主要用于字典 (Dictionary) 的打包和解包。

打包：

1
def example(**kwargs):
2
    for key,value in kwargs.items():
3
        print(f"{key}={value}")
4

5
example(a=1,b=2)

解包（字典）：

1
def create_profile(name,age,email):
2
    print(f"name : {name}")
3
    print(f"age : {age}")
4
    print(f"email : {email}")
5

6
option = {
7
    "name":"tony",
8
    "age":18,
9
    "email":"tony@qq.com"
10
}
11

12
create_profile(**option)

装饰器#

修饰器本质上是一个以函数作为参数，并返回新函数的函数。

**装饰器的基本用法： **

1
def my_decorator(func):
2
    def wrapper():
3
        print("Before function")
4
        func()
5
        print("After function")
6
    return wrapper
7

8
@my_decorator
9
def hello():
10
    print("Hello!")
11

12
hello()

输出：

1
Before function
2
Hello!
3
After function

1
def hello():
2
    ...

设计模式#

工厂模式#

把“创建对象”的代码，集中放到一个地方管理

1
# 工厂模式
2
class DatabaseConnection:
3
    def __init__(self,host,port,username,password):
4
        self.host = host
5
        self.port = port
6
        self.username = username
7
        self.password = password
8

9
    def connect(self):
10
        return f'Connecting to database at {self.host}:{self.port} with username {self.username}'
11

12
def connection_factory(db_type):
13
    db_config ={
14
            'main':{
15
                'host': 'localhost',
16
                'port': 3306,
17
                'username': 'root',
18
                'password':'password123'
19
            },
20
            'analytics':{
21
                'host': '192.168.1.1',
22
                'port': 5432,
23
                'username': 'admin',
24
                'password':'pass3'
25
            },
26
            'cache':{
27
                'host': '10.0.0.1',
28
                'port': 27017,
29
                'username': 'cacher',
30
                'password':'cipher'
31
            }
32
     }
33
    return DatabaseConnection(**db_config[db_type]) #字典解包
34

35

36
def client():
37
    main_db = connection_factory('main')
38
    analytics_db = connection_factory('analytics')
39
    cache_db = connection_factory('cache')
40

41
    print(main_db.connect())
42
    print(analytics_db.connect())
43
    print(cache_db.connect())
44

45
client()

单例模式#

1
class DatabaseConnection:
2
    _instance = None
3

4
    def __new__(cls,*args,**kwargs) -> Self:
5
        if cls._instance is None:
6
            cls._instance = super().__new__(cls)
7
        return cls._instance

模块#

导入自己写的模块#

1
project/
2
  main.py
3
  moduleA.py

moduleA.py内容：

1
def hello():
2
    print("Hello from module!")

main.py 中：

1
import moduleA
2

3
mytool.hello()

在不同目录下，__init__.py的用法：

1
my_package/
2
    __init__.py
3
    util.py

1
import my_package

相当于导入了__init__.py

__init__.py的使用：

包的初始化

管理包接口

相对导入#

相对导入的符号：

写法	含义
`.`	当前包（current package）
`..`	上一级目录（parent package）
`...`	上上级目录（grandparent package）

示例目录结构#

1
project/
2
    packageA/
3
        __init__.py
4
        a.py
5
        b.py
6
        sub/
7
            __init__.py
8
            c.py

1
from . import
2
# from .a import func_a 导入a某个函数

不写点 Python 会从项目根路径开始查找 a 模块，而不是同级目录。

from package.a import func_a = from .a import func_a

非同级目录的导入#

from packageA.a import x的过程

路径 packageA.a 其实是：

1
packageA   ← 一个包（必须先被加载）
2
packageA.a ← 包里的模块

所以导入 packageA.a 的过程是：

1
import packageA
2
import packageA.a

魔术方法#

<font style="color:rgb(15, 17, 21);background-color:rgb(235, 238, 242);">__all__</font> 是一个模块级别的变量（通常是一个字符串列表），用于定义当客户端代码使用 <font style="color:rgb(15, 17, 21);background-color:rgb(235, 238, 242);">from module import *</font> 时，哪些名称会被导入到当前命名空间。

1
__all__ = ['public_func', 'PublicClass']
2

3
def public_func():
4
    return "I'm public!"
5

6
def _private_func():
7
    return "I'm private (by convention)!"
8

9
class PublicClass:
10
    pass
11

12
class _PrivateClass:
13
    pass

json的基础使用和操作#

将json导出至文件

1
import json
2

3
user = {}
4
user['name'] = input('请输入姓名')
5
user['age'] = int(input('请输入年龄'))
6

7
contents = json.dumps(user) #字典 -> 字符串
8

9
with open("user_info.json", "r+", encoding="utf-8") as f:
10
    f.write(contents)

读取json文件

1
import json
2

3
with open("./user_info.json","r",encoding="utf-8") as f:
4
    contents = f.read()
5
    user = json.loads(contents) # 字典 <- 字符串
6

7
    print(f'欢迎回来{user['name']}')

迭代器和生成器#

for循环原理：

1
it =iter(my_str)  #my_str.__iter__()，my_str为可迭代对象，it为迭代器
2
while True:
3
    try:
4
        print(next(it)) #调用迭代器的next方法
5
    except StopIteration: #无法获取迭代器的下一个值
6
        break

示例：

1
class MyNumbers:
2
    def __init__(self):
3
        self.num = 1
4

5
    def __iter__(self):
6
        return self #获取迭代器自己
7

8
    def __next__(self):
9
        if self.num <= 3:
10
            x = self.num
11
            self.num += 1
12
            return x
13
        else:
14
            raise StopIteration

生成器函数#

函数里只要出现 yield，这个函数就变成 生成器函数。

1
def generator(n):
2
    for i in range(n):
3
        print('before yield')
4
        yield i
5
        print("after yield")
6

7
gen = generator(3) #gen 为生成器
8
print(next(gen)) #生成0后退             出生成器函数
9
print("---")
10
for i in gen:
11
    print(i)

1
before yield
2
0
3
---
4
after yield
5
before yield
6
1
7
after yield
8
before yield
9
2
10
after yield

柯里化#

把一个接受多个参数的函数，变成一系列只接受一个参数的函数。

柯里化后函数

1
def add(a):
2
    def inner(b):
3
        return a + b
4
    return inner

执行：

1
add(3)  → 返回 inner 函数，相当于固定a的值
2
inner(5) → 计算 3 + 5

functools.partial(func, 参数1, 参数2, ...)：
示例：

1
from functools import partial
2

3
def multiply(a, b):
4
    return a * b
5

6
double = partial(multiply, 2)
7

8
print(double(5))

多线程#

1
from time import sleep,time
2
from threading import Thread
3

4
start_time = time()
5

6
def dnownload_img(url):
7
    sleep(1)
8
    print(f'{url} download complete')

线程池：

1
from decimal import MAX_EMAX
2
from time import time,sleep
3
from concurrent import futures #线程池
4

5
start_time = time()
6

7
def download_img(url):
8
    sleep(1)
9
    return (f'{url} download complete')
10

11
with futures.ThreadPoolExecutor(max_workers=10) as excutor:
12
    results = excutor.map(download_img,range(10)) #range(10)作为download_img的参数
13
    for result in results:
14
        print(result)

**守护线程（daemon=True）：**主线程一结束，**不管它干没干完，直接结束；**不会影响程序退出

枚举#

定义枚举

1
import enum
2

3
class ChapterResult(enum.Enum):
4
    SUCCESS = 0
5
    ERROR = 1
6
    NOT_OPEN = 2
7
    PENDING = 3

使用枚举

1
result = ChapterResult.SUCCESS

属性

属性	含义	示例
`result.name`	枚举成员名	`"ERROR"`
`result.value`	实际存储的值	`1`

遍历查看member：

1
from enum import Enum
2

3
class Color(Enum):
4
    RED = 1
5
    GREEN = 2
6
    BLUE = 3
7

8
for c in Color:
9
    print(c.name,c.value)

enum和bool一样，也具有唯一实例性和不可变

数据解析#

正则表达式#

元字符	含义
`.`	匹配任意一个字符（除换行）
`\d`	数字 `[0-9]`
`\D`	非数字
`\w`	字母、数字、下划线
`\W`	非字母数字下划线
`\s`	空白字符（空格、`\t` 、`\n` ）
`\S`	非空白字符
`	`
`[...]`	匹配字符组中的字符
`[^...]`	除了字符组里的内容

限定符（控制前面原字符出现的次数）	含义
`*`	0 次或多次
`+`	1 次或多次
`?`	0 次或 1 次
`{n}`	恰好 n 次
`{n, }`	至少 n 次
`{n,m}`	n 到 m 次

^从开头开始匹配，$结尾匹配

贪婪匹配和惰性匹配

1
.*  贪婪匹配（尽可能多地匹配结果）
2
.*? 惰性匹配（尽可能少地匹配结果 -> 回溯）

re模块#

re.match(pattern, string)—— 只在字符串“开头”尝试匹配

示例：

1
import re
2

3
text = "123abc"
4

5
m = re.match(r"\d+", text)
6
print(m.group())   # 123

re.search(pattern, string)—— 在整个字符串中查找第一个匹配结果

示例：

1
text = "abc123def456"
2

3
m = re.search(r"\d+", text)
4
print(m.group())  #从匹配到的结果拿数据
5
# 123

re.findall(pattern, string)—— 返回所有匹配结果的列表

示例（注意有无分组）：

1
re.findall(r"(\d+)", text)
2
# ['123', '456', '789']

1
re.findall(r"(\d+)([a-z]+)", "123abc456def")
2
# [('123', 'abc'), ('456', 'def')]

re.finditer返回的是迭代器
re.sub(pattern, repl, string)

示例：

1
text = "电话：123-456-789"
2

3
new = re.sub(r"\d", "*", text)
4
print(new)
5
# 电话：***-***-***

re.split(pattern, string)—— 按正则规则切字符串

示例：

1
text = "apple,banana;orange|grape"
2

3
parts = re.split(r"[,;|]", text)
4
print(parts)
5
# ['apple', 'banana', 'orange', 'grape']

预加载，提前把正则对象加载完毕：

1
obj = re.compile(r"\d+")
2
for item in x:
3
    obj.finditer(item)

1
obj = re.compile(r"\d+")
2
#直接把加载好的正则使用
3
result = obj.findall(text)

分组：

普通分组

1
(\d+)

自动编号：1、2、3……；用 group(1)、group(2) 取

1
import re
2

3
m = re.search(r"(\d+)-(\w+)", "123-abc")
4
print(m.group(1))  # 123
5
print(m.group(2))  # abc

命名分组

1
(?P<name>\w+)

示例：

1
m = re.search(r"(?P<year>\d{4})-(?P<month>\d{2})", "2026-02")
2
print(m.group("year"))   # 2026
3
print(m.group("month"))  # 02

1
import re
2

3
s="""
4
<div class='西游记'><span id='10010'>中国联通</span></div>
5
<div class='西游记'><span id='10086'>中国移动</span></div>
6
"""
7

8
obj =re.compile(r"<span id='(?P<id>\d+)'>(?P<name>.*?)</span>")
9
result =obj.finditer(s)
10
for item in result:
11
    id = item.group("id")
12
    print(id)
13

14
    name=item.group("name")
15
    print(name)

time模块#

time 提供的是偏底层、面向系统时间的功能。

常用函数：

time.time() —— 当前时间戳
time.localtime([sec]) —— 本地时间结构

1
time.struct_time(
2
    tm_year,   # 年
3
    tm_mon,    # 月 (1-12)
4
    tm_mday,   # 日 (1-31)
5
    tm_hour,   # 时 (0-23)
6
    tm_min,    # 分 (0-59)
7
    tm_sec,    # 秒 (0-61)
8
    tm_wday,   # 星期 (0=周一)
9
    tm_yday,   # 一年中的第几天
10
    tm_isdst   # 是否夏令时
11
)

time.ctime()——返回当前时间戳的字符串表示
time.strftime() —— 时间 → 字符串

1
now = time.localtime()
2
s = time.strftime("%Y-%m-%d %H:%M:%S", now)
3
print(s)

输出

1
2025-12-24 21:30:00

常用格式符：

格式	含义
`%Y`	年
`%m`	月
`%d`	日
`%H`	时
`%M`	分
`%S`	秒

time.strptime() —— 字符串 → 时间 , 返回 struct_time

1
t = time.strptime("2025-12-24 21:30:00", "%Y-%m-%d %H:%M:%S")

loguru库#

安装：pip install loguru

快速入门#

严重程度逐渐增加

级别	描述
TRACE	最详细的日志，调试级别
DEBUG	调试信息
INFO	普通信息
SUCCESS	成功信息
WARNING	警告
ERROR	错误
CRITICAL	严重错误，程序可能无法继续

1
from loguru import logger
2

3
logger.debug("调试信息")
4
logger.info("普通信息")
5
logger.warning("警告信息")
6
logger.error("错误信息")
7
logger.critical("严重错误")

add方法#

1
logger.remove() #移除了 loguru 的所有默认 sink
2
logger.add("file.log", rotation="5 MB", retention="10 days", level="INFO")

参数解释：

rotation → 日志文件达到多少大小或时间就切分
retention → 保留多长时间的日志
level → 最低记录级别

tqdm库#

tqdm 是 Python 中非常流行的 进度条库
安装：**pip install tqdm**

手动更新#

pbar.update(n) 表示在当前进度基础上增加 n 个 item

1
pbar = tqdm(total=100)
2
pbar.update(10)
3
sleep(2)
4
pbar.update(20)
5
sleep(2)
6
pbar.update(70)
7
pbar.close()

使用with语句：

1
with tqdm(total=100) as pbar:
2
    pbar.update(10)
3
    sleep(2)
4
    pbar.update(20)
5
    sleep(2)
6
    pbar.update(70)

write用法#

功能：在进度条上方打印消息，不破坏 tqdm 进度条

1
tqdm.write(msg, file=sys.stdout, end='\n', nolock=False)

参数：

参数	说明
msg	要打印的消息（字符串）
file	输出流，默认 `sys.stdout`
end	行尾符，默认 `\n`
nolock	多线程/多进程环境下是否使用锁，默认 `False` （安全）

argparse 模块#

argparse 是 Python 的解析命令行参数的标准库

命令行解析器#

1
import argparse
2
parser = argparse.ArgumentParser() #创建解析器parser
3

4
parser.add_argument("--name") #添加参数
5

6
args = parser.parse_args() #解析参数
7
print(args.name)

终端执行：

1
python test.py --name Tom

输出：

1
Tom

parser.parse_args() 返回Namespace类，打点访问具体值

参数	作用
name / —name	参数名
help	帮助说明
type	类型
default	默认值
required	是否必填
choices	可选值
action	行为方式

位置参数：

1
import argparse
2

3
parser = argparse.ArgumentParser()
4
parser.add_argument("filename")   # 定义一个“位置参数”
5
args = parser.parse_args()
6

7
print(args.filename)

限定取值：

1
parser.add_argument("--mode", choices=["train", "test"])

**action**参数：

默认行为：store 最常用：store_true store_false

count：统计出现次数

append：可以写多次：

1
python main.py --tag A --tag B --tag C

结果：

1
args.tag == ["A", "B", "C"]

布尔开关参数

1
parser.add_argument("--debug", action="store_true")

1
python main.py --debug

1
args.debug == True

没写就是 False。

sys.argv是命令行参数（脚本名称是 sys.argv[0] ）

configparser 模块#

配置文件解析模块

config.ini

1
[database]
2
host = localhost
3
port = 3306
4
user = root
5
password = 123456
6

7
[server]
8
debug = true
9
timeout = 30

configparser.ConfigParser.items()

基本用法：

1
import configparser
2

3
config = configparser.ConfigParser()
4
config.read("config.ini", encoding="utf-8")
5

6
print(config.sections())           # 所有 section
7
print(config["database"]["host"])  # 取值（字符串）
8
print(config.get("database", "port"))

`urllib.request`模块#

urllib.request 是 python 自带的最基础网络库

1
from urllib.request import urlopen
2

3
url = "http://www.baidu.com"
4

5
resp = urlopen(url)
6

7
with open("msg.txt","wb") as f:
8
    f.write(resp.read())
9

10

11
print(resp.read())

解码：

1
print(resp.read().decode("utf-8"))

`requests`模块#

首个爬虫程序

1
import requests
2

3
url = "http://baidu.com"
4

5
resp = requests.get(url)
6
resp.encoding = "utf-8"
7
print(resp.text)

使用get请求

1
import requests
2

3
content = input('请输入你要检索的内容：')
4
url = f"https://sogou.com/web?query={content}"
5

6

7
headers = {
8
    "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/145.0.0.0 Safari/537.36"
9
}
10
resp = requests.get(url, headers=headers)
11
resp.encoding = "utf-8"
12
print(resp.text)

很多参数的get请求

1
import requests
2

3
url = "https://movie.douban.com/j/chart/top_list"
4

5
data = {
6
    "type": "13",
7
    "interval_id": "100:90",
8
    "action": "",
9
    "start": "0",
10
    "limit": "20"
11
}
12

13
headers = {
14
    "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/145.0.0.0 Safari/537.36"
15
}
16

17
resp = requests.get(url,params=data,headers=headers)
18
print(resp.text)

使用post请求

1
import requests
2

3
url = "https://fanyi.baidu.com/sug"
4
data = {
5
    "kw": input("请输入一个单词:")
6

7
}
8

9
resp = requests.post(url, data=data)
10
print(resp.json()["data"]) # 返回字典

session

自动保存 Cookie，自动带上之前的 Cookie，复用连接（更快），统一管理 headers、params、auth 等

1
import requests
2

3
session = requests.Session()
4

5
# 登录
6
session.post("https://xxx.com/login", data={
7
    "user": "a",
8
    "pwd": "123"
9
})
10

11
# 已经是“登录态”
12
resp = session.get("https://xxx.com/profile")
13
print(resp.text)

mount 是 requests.Session 里一个比较高级但非常有用的接口，主要用于：“给不同协议或不同域名，绑定不同的“连接适配器（HTTPAdapter）”

常见用法：

给 http / https 设置重试策略

1
from requests import Session
2
from requests.adapters import HTTPAdapter
3

4
s = Session()
5
s.mount("http://", HTTPAdapter(max_retries=3))
6
s.mount("https://", HTTPAdapter(max_retries=3))
7

8
s.get("https://example.com")

只对某个域名生效

1
s.mount("https://api.example.com", HTTPAdapter(max_retries=10))

控制连接池大小（进阶）

1
adapter = HTTPAdapter(
2
    pool_connections=10,
3
    pool_maxsize=20,
4
    max_retries=5
5
)
6

7
s.mount("https://", adapter)

参数含义：

pool_connections：连接池数量
pool_maxsize：每个池最多多少连接
max_retries：重试次数

proxies代理

1
import requests
2

3
url="http://baidu.com"
4

5
#代理
6
proxy={
7
    "http": "xxx",
8
    "https": "xxx"
9
}
10
resp = requests.get(url,proxies=proxy)
11
resp.encoding="utf-8"
12
print(resp.text)

timeout

timeout 用来限制：请求最多等多久；如果超过这个时间还没成功，就直接抛异常，不再傻等。

bs4模块#

1
pip install lxml

基本使用方式

1
import requests
2
from bs4 import BeautifulSoup
3

4
url = "https://example.com"
5

6
html = requests.get(url).text
7
soup = BeautifulSoup(html, "lxml")

获取标签、获取文本

1
soup.h1 #soup.find("h1")

1
soup.h1.text #soup.h1.string

find和find_all（找所有）

1
soup.find("p", class_="text")

方法	功能	返回值	找不到时
`soup.find(name, attrs, ...)`	查找第一个匹配的标签	单个 `Tag` 对象	`None`
`soup.find_all(name, attrs, ...)`	查找所有匹配的标签	包含 `Tag` 对象的`list`	空列表 `[]`

进阶：使用CSS选择器

soup.select('selector')：返回所有匹配元素的列表。
soup.select_one('selector')：返回第一个匹配的单个元素

文本提取：

方法	说明	示例
`tag.get_text()`	获取所有内部文本（包括子孙），可传分隔符	`tag.get_text(strip=True)`
`.string`	仅当标签内部只有一个文本节点时有效，否则为 `None`	`<p>Hello</p>` 可以得到 `'Hello'`
`.strings`	生成器，返回所有子孙文本节点	适合处理混杂标签与文本
`.stripped_strings`	同 `.strings` ，但自动去除空白	最常用于抽取纯文本列表

南栀 Space