用PyTorch复现PINN经典案例：手把手教你用神经网络求解Burgers方程

张开发

• 2026/6/14 3:46:39 • 15 分钟阅读

分享文章

用PyTorch复现PINN经典案例手把手教你用神经网络求解Burgers方程在科学计算领域偏微分方程(PDE)的求解一直是极具挑战性的任务。传统数值方法如有限差分法、有限元法虽然成熟但面临网格生成复杂、计算成本高等问题。近年来物理信息神经网络(PINN)的兴起为这一领域带来了全新思路——它巧妙地将物理定律编码为神经网络的损失函数实现了无网格、数据驱动的PDE求解方案。本文将聚焦流体力学中的经典Burgers方程带你从零实现一个完整的PyTorch版PINN求解器。1. 环境准备与问题定义首先确保已安装Python 3.8和最新版PyTorch。推荐使用conda创建虚拟环境conda create -n pinn python3.8 conda activate pinn pip install torch torchvision matplotlib numpyBurgers方程是描述一维粘性流体运动的经典PDE其形式为$$ u_t uu_x \nu u_{xx}, \quad x \in [-1,1], t \in [0,1] $$其中$\nu0.01/\pi$为粘性系数。初始条件为$u(0,x) -\sin(\pi x)$边界条件为$u(t,-1)u(t,1)0$。这个非线性方程会产生激波现象是测试数值方法的理想基准。2. 神经网络架构设计我们采用全连接网络(MLP)作为近似器其核心结构如下import torch import torch.nn as nn class BurgersPINN(nn.Module): def __init__(self, layers[2,20,20,20,20,1]): super().__init__() self.activation nn.Tanh() self.layers nn.ModuleList() for i in range(len(layers)-1): self.layers.append(nn.Linear(layers[i], layers[i1])) def forward(self, t, x): X torch.cat([t,x], dim1) for i, layer in enumerate(self.layers[:-1]): X self.activation(layer(X)) return self.layers[-1](X)这里有几个关键设计选择输入层接受时空坐标(t,x)的二维输入隐藏层4层20神经元的密集连接层激活函数Tanh函数适合光滑解且二阶导数易计算输出层单神经元输出解u(t,x)的预测值3. 损失函数构建PINN的核心创新在于将物理约束融入损失函数。我们需要三类约束def compute_loss(model, t_data, x_data, u_data, t_phys, x_phys): # 初始条件损失 u_pred model(t_data, x_data) mse_data torch.mean((u_pred - u_data)**2) # 物理残差损失 t_phys.requires_grad_(True) x_phys.requires_grad_(True) u model(t_phys, x_phys) u_t torch.autograd.grad(u, t_phys, grad_outputstorch.ones_like(u), create_graphTrue)[0] u_x torch.autograd.grad(u, x_phys, grad_outputstorch.ones_like(u), create_graphTrue)[0] u_xx torch.autograd.grad(u_x, x_phys, grad_outputstorch.ones_like(u_x), create_graphTrue)[0] f u_t u*u_x - (0.01/torch.pi)*u_xx mse_phys torch.mean(f**2) # 边界条件损失 t_bound torch.linspace(0,1,100).view(-1,1) x_left -1*torch.ones_like(t_bound) x_right 1*torch.ones_like(t_bound) u_left model(t_bound, x_left) u_right model(t_bound, x_right) mse_bc torch.mean(u_left**2) torch.mean(u_right**2) return mse_data mse_phys mse_bc损失项说明损失类型计算方式物理意义初始条件MSE(u_pred, u_true)保证t0时满足初始条件物理残差MSE(f)强制满足控制方程边界条件MSE(u_boundary)确保边界约束4. 训练策略优化高效的训练需要精心设计采样策略和优化配置def train(): # 数据生成 N_data 100 # 初始数据点 N_phys 10000 # 物理残差点 t_data torch.zeros(N_data,1) x_data torch.linspace(-1,1,N_data).view(-1,1) u_data -torch.sin(torch.pi*x_data) # Latin Hypercube采样 t_phys torch.rand(N_phys,1) x_phys 2*torch.rand(N_phys,1)-1 model BurgersPINN() optimizer torch.optim.Adam(model.parameters(), lr1e-3) for epoch in range(20000): optimizer.zero_grad() loss compute_loss(model, t_data, x_data, u_data, t_phys, x_phys) loss.backward() optimizer.step() if epoch % 1000 0: print(fEpoch {epoch}: Loss {loss.item():.4e})关键训练技巧学习率调度初期使用较高学习率(1e-3)后期降至1e-4批量训练对物理残差点采用mini-batch训练自适应权重动态调整各项损失的相对权重L-BFGS微调Adam训练后可用二阶优化器进一步优化5. 结果可视化与分析训练完成后我们可以可视化预测结果import matplotlib.pyplot as plt from mpl_toolkits.mplot3d import Axes3D def plot_solution(model): t torch.linspace(0,1,100) x torch.linspace(-1,1,100) T, X torch.meshgrid(t,x) with torch.no_grad(): U model(T.reshape(-1,1), X.reshape(-1,1)) U U.reshape(100,100).numpy() fig plt.figure(figsize(12,6)) ax fig.add_subplot(111, projection3d) ax.plot_surface(T.numpy(), X.numpy(), U, cmapviridis) ax.set_xlabel(Time) ax.set_ylabel(Position) ax.set_zlabel(Velocity) plt.show()典型训练结果会显示初始时刻的正弦波形随时间演化的激波形成过程最终耗散导致的波形衰减与解析解对比的误差通常在1%以内证明了PINN的有效性。相比传统方法PINN具有以下优势无网格特性无需复杂的空间离散并行计算友好GPU可加速训练过程连续解表示可直接求任意点的解值多物理场耦合易于扩展复杂方程组6. 高级技巧与问题排查实际应用中可能会遇到以下挑战及解决方案梯度爆炸问题现象训练初期损失急剧增大对策使用梯度裁剪调整激活函数(如改用Swish)降低初始学习率局部极小值陷阱现象损失停滞在较高值对策增加物理残差点密度尝试不同的网络初始化引入残差连接计算效率优化# 使用torch.jit.script加速自动微分 torch.jit.script def compute_residual(u, t, x): u_t torch.autograd.grad(u, t, create_graphTrue)[0] u_x torch.autograd.grad(u, x, create_graphTrue)[0] u_xx torch.autograd.grad(u_x, x, create_graphTrue)[0] return u_t u*u_x - (0.01/torch.pi)*u_xx其他实用技巧对时空域进行分区训练采用自适应采样策略结合传统数值方法提供初始猜测使用集成学习降低方差7. 扩展应用与前沿方向成功求解Burgers方程后该框架可扩展到更复杂场景多尺度问题通过层次化网络结构捕捉不同尺度特征局部加密采样策略逆问题求解# 假设ν未知将其设为可训练参数 nu nn.Parameter(torch.rand(1)) def residual(u, t, x): return u_t u*u_x - nu*u_xx高维PDE使用傅里叶特征网络提升高频表征能力结合注意力机制处理长程依赖当前PINN研究的前沿方向包括不确定性量化几何自适应求解与符号计算的结合硬件专用架构设计在工业应用中PINN已成功用于湍流建模生物流体模拟地下资源预测超材料设计