Metax GPU + PaddleOCR-VL-1.5 + FastDeploy 编译打卡记录

百度黑客松打卡任务:

在 MetaX GPU 环境下完成 FastDeploy 编译打卡的完整流程,目标是一次性跑通:依赖安装、源码编译、wheel 安装、结果截图和邮件提交。


环境信息


执行步骤

1. 安装 Paddle 与 MetaX 相关依赖

pip install paddlepaddle==3.4.0.dev20251223 -i https://www.paddlepaddle.org.cn/packages/nightly/cpu/
pip install paddle-metax-gpu==3.3.0.dev20251224 -i https://www.paddlepaddle.org.cn/packages/nightly/maca/
python -m pip install -U "paddleocr[doc-parser]"
pip install opencv-contrib-python-headless==4.10.0.84

依赖速览

  • paddlepaddle:飞桨框架本体,负责模型计算与执行。
  • paddle-metax-gpu:MetaX GPU 后端适配,让模型真正跑在 MetaX 显卡上。
  • paddleocr[doc-parser]:OCR 与文档解析能力包,用于文档理解任务。
  • opencv-contrib-python-headless:服务器端图像处理库(无 GUI),用于预处理和后处理。

2. 获取 FastDeploy 源码并切换分支

git clone https://gitee.com/paddlepaddle/FastDeploy.git
cd FastDeploy
git checkout release/2.5

3. 配置编译环境变量

建议把下面内容保存为 env_metax.sh,然后执行 source env_metax.sh

#!/bin/sh
export MACA_PATH=/opt/maca

if [ ! -d ${HOME}/cu-bridge ]; then
	${MACA_PATH}/tools/cu-bridge/tools/pre_make
fi

export CUCC_PATH=/opt/maca/tools/cu-bridge
export CUCC_CMAKE_ENTRY=2
export CUDA_PATH=${HOME}/cu-bridge/CUDA_DIR
export PATH=${CUDA_PATH}/bin:${MACA_PATH}/mxgpu_llvm/bin:${MACA_PATH}/bin:${CUCC_PATH}/tools:${CUCC_PATH}/bin:${PATH}
export LD_LIBRARY_PATH=${CUDA_PATH}/lib64:${MACA_PATH}/lib:${MACA_PATH}/mxgpu_llvm/lib:$LD_LIBRARY_PATH
export MACA_VISIBLE_DEVICES="0"
export PADDLE_XCCL_BACKEND=metax_gpu
export FLAGS_weight_only_linear_arch=80
export FD_MOE_BACKEND=cutlass
export ENABLE_V1_KVCACHE_SCHEDULER=1
export FD_ENC_DEC_BLOCK_NUM=2
export FD_SAMPLING_CLASS=rejection

4. 执行编译

source env_metax.sh
bash build.sh

预期产物在 ~/fastdeploy/dist(或你的 build 脚本指定目录)。

5. 安装编译出的 wheel 包

cd ~/fastdeploy/dist
pip install *.whl

命令总览

下面这个代码块汇总了本次打卡会用到的主要命令,可以直接复制执行。

# 1) 安装依赖
pip install paddlepaddle==3.4.0.dev20251223 -i https://www.paddlepaddle.org.cn/packages/nightly/cpu/
pip install paddle-metax-gpu==3.3.0.dev20251224 -i https://www.paddlepaddle.org.cn/packages/nightly/maca/
python -m pip install -U "paddleocr[doc-parser]"
pip install opencv-contrib-python-headless==4.10.0.84

# 2) 下载源码并切换分支
git clone https://gitee.com/paddlepaddle/FastDeploy.git
cd FastDeploy
git checkout release/2.5

# 3) 创建环境脚本(示例名:env_metax.sh)
cat > env_metax.sh << 'EOF'
#!/bin/sh
export MACA_PATH=/opt/maca

if [ ! -d ${HOME}/cu-bridge ]; then
	${MACA_PATH}/tools/cu-bridge/tools/pre_make
fi

export CUCC_PATH=/opt/maca/tools/cu-bridge
export CUCC_CMAKE_ENTRY=2
export CUDA_PATH=${HOME}/cu-bridge/CUDA_DIR
export PATH=${CUDA_PATH}/bin:${MACA_PATH}/mxgpu_llvm/bin:${MACA_PATH}/bin:${CUCC_PATH}/tools:${CUCC_PATH}/bin:${PATH}
export LD_LIBRARY_PATH=${CUDA_PATH}/lib64:${MACA_PATH}/lib:${MACA_PATH}/mxgpu_llvm/lib:$LD_LIBRARY_PATH
export MACA_VISIBLE_DEVICES="0"
export PADDLE_XCCL_BACKEND=metax_gpu
export FLAGS_weight_only_linear_arch=80
export FD_MOE_BACKEND=cutlass
export ENABLE_V1_KVCACHE_SCHEDULER=1
export FD_ENC_DEC_BLOCK_NUM=2
export FD_SAMPLING_CLASS=rejection
EOF

# 4) 执行编译
source env_metax.sh
bash build.sh

# 5) 安装编译产物
cd ~/fastdeploy/dist
pip install *.whl