• zhaoxiaolong2020

    @刘看山 git clone https://github.com/jinfagang/DCNv2_latest.git 这个就是群主的fork版本的,我还是测试python testcuda.py报错,你有配置成功么

    发布在 社区求助区(SOS!!) 阅读更多
  • zhaoxiaolong2020

    补充一下, 我之后又切换到pytorch1.7.0个1.7.1版本,进行尝试,

    git clone https://github.com/jinfagang/DCNv2_latest.git
    cd DCNv2_latest
    python setup.py build develop
    

    首先是能编译成功的,但是进行测试

    python testcuda.py
    

    就报错如下:

    NVIDIA GeForce RTX 3070 Laptop GPU with CUDA capability sm_86 is not compatible with the current PyTorch installation.
    The current PyTorch install supports CUDA capabilities sm_37 sm_50 sm_60 sm_70 sm_75.
    

    说RTX3070在当前pytorch1.7版本下不能用,所以只能想办法在pytorch1.8.1版本上解决了,求大神指点

    发布在 社区求助区(SOS!!) 阅读更多
  • zhaoxiaolong2020

    官方smoke 开源项目的环境 requirements
    Ubuntu 16.04
    Python 3.7
    Pytorch 1.3.1
    CUDA 10.0

    但是我的电脑是RTX3070的显卡,默认支持cuda11.1及以上版本,所以我就各种尝试配置环境如下:

    (base) zxl@R9000P:~/mywork/MANA-AI/DCNv2_latest$ nvcc -V
    nvcc: NVIDIA (R) Cuda compiler driver
    Copyright (c) 2005-2020 NVIDIA Corporation
    Built on Mon_Oct_12_20:09:46_PDT_2020
    Cuda compilation tools, release 11.1, V11.1.105
    Build cuda_11.1.TC455_06.29190527_0
    

    ubuntu18.04
    python 3.8
    pytorch1.8.1和1.9.0
    CUDA11.1

    在pytorch1.8.1和1.9.0环境下执行指令:

    python setup.py build develop
    

    后报错如下

    /home/zxl/mywork/MANA-AI/smoke_mono_3d/smoke/csrc/cuda/dcn_v2_cuda.cu(127): 
    error: identifier "THCudaBlas_SgemmBatched" is undefined
    /home/zxl/mywork/MANA-AI/smoke_mono_3d/smoke/csrc/cuda/dcn_v2_cuda.cu(275): error: identifier "THCudaBlas_Sgemm" is undefined
    /home/zxl/mywork/MANA-AI/smoke_mono_3d/smoke/csrc/cuda/dcn_v2_cuda.cu(329): error: identifier "THCudaBlas_Sgemv" is undefined
    3 errors detected in the compilation of "/home/zxl/mywork/MANA-AI/smoke_mono_3d/smoke/csrc/cuda/dcn_v2_cuda.cu".
    error: command '/usr/local/cuda-11.1/bin/nvcc' failed with exit status 1
    

    网上各种百度找到了下面的方法

    git clone https://github.com/jinfagang/DCNv2_latest.git
    cd DCNv2_latest
    python setup.py build develop
    

    cuda 11.1 +pytorch1.8.1或1.9.0能build成功,但是执行测试程序

    python testcuda.py
    

    报错如下

    raise RuntimeError(msg)
    RuntimeError: Jacobian mismatch for output 0 with respect to input 0,
    numerical:tensor([[ 0.4043,  0.0048, -0.0100,  ...,  0.0000,  0.0000,  0.0000],
           [ 0.1935,  0.0695,  0.0132,  ...,  0.0000,  0.0000,  0.0000],
           [-0.0009,  0.0000,  0.3827,  ...,  0.0000,  0.0000,  0.0000],
           ...,
           [ 0.0000,  0.0000,  0.0000,  ...,  0.0000, -0.0237,  0.0000],
           [ 0.0000,  0.0000,  0.0000,  ..., -0.4143, -0.8342,  0.0000],
           [ 0.0000,  0.0000,  0.0000,  ..., -0.2155, -0.1278, -0.1084]],
          device='cuda:0')
    analytical:tensor([[ 0.4043,  0.0049, -0.0100,  ...,  0.0000,  0.0000,  0.0000],
           [ 0.1934,  0.0695,  0.0133,  ...,  0.0000,  0.0000,  0.0000],
           [-0.0011,  0.0000,  0.3829,  ...,  0.0000,  0.0000,  0.0000],
           ...,
           [ 0.0000,  0.0000,  0.0000,  ...,  0.0000, -0.0237,  0.0000],
           [ 0.0000,  0.0000,  0.0000,  ..., -0.4146, -0.8340,  0.0000],
           [ 0.0000,  0.0000,  0.0000,  ..., -0.2157, -0.1280, -0.1084]],
          device='cuda:0')
    

    最让我百思不得其解的是,三个环境下运行下面的指令都成功

    (py38torch171-smoke) zxl@R9000P:~/mywork/MANA-AI/DCNv2_latest$ python
    Python 3.8.10 (default, Jun  4 2021, 15:09:15) 
    [GCC 7.5.0] :: Anaconda, Inc. on linux
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import torch
    >>> torch.__version__
    '1.7.1'
    >>> torch.cuda.is_available()
    True
    >>>
    
    (py38torch181-smoke) zxl@R9000P:~/mywork/MANA-AI/DCNv2_la00m$ python
    Python 3.8.10 (default, Jun  4 2021, 15:09:15) 
    [GCC 7.5.0] :: Anaconda, Inc. on linux
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import torch
    >>> torch.__version__
    '1.8.1+cu111'
    >>> torch.cuda.is_available()
    True
    >>>
    
    (py38torch190-smoke) zxl@R9000P:~/mywork/MANA-AI/DCNv2_latest$ python
    Python 3.8.10 (default, Jun  4 2021, 15:09:15) 
    [GCC 7.5.0] :: Anaconda, Inc. on linux
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import torch
    >>> torch.__version__
    '1.9.0+cu111'
    >>> torch.cuda.is_available()
    True
    >>>
    

    研究了一天了没成功,求大佬指点,有没有在RTX30系列显卡cuda11.1上配置好环境的,万分感激!

    发布在 社区求助区(SOS!!) 阅读更多

与 神力AI社区 的连接断开,我们正在尝试重连,请耐心等待