use different stream when create handle #2526

irexyc · 2023-10-31T12:20:28Z

Motivation

Modification

use different cuda stream when create handle.

RunningLeon

LGTM

* docs(project): deploee introduction (open-mmlab#2120) * docs(project): deploee introduction * Update README_zh-CN.md * fix mmseg output sync (open-mmlab#2122) * Update docs (open-mmlab#2114) * update docs from 1.x to main * fix dead links * fix tag_name * Revert "fix tag_name" This reverts commit fcf0c58. * fix readthedocs for zh_cn * support mscv statically linked CRT (open-mmlab#2118) * support mscv mt * prevent modification of flag * fix uploading assets (open-mmlab#2130) * Fit for MMPretrain EfficientFormer (open-mmlab#2108) * add ef in rgtest metafile * fix for EfficientFormerClsHead to LinearClsHead * fix lint * update benchmark * fix lint * fix docs * fix docs * add openvino test * fix export info for mmyolo (open-mmlab#2146) * fix * fix * fix * Update build_ubuntu_x64_ncnn.py (open-mmlab#2154) * update document for docker installation (open-mmlab#2142) * update * update * add docker ci * update * fix * [ADD] pplcv area interpolation (open-mmlab#2153) * ADD: area interpolation for PPL.CV (cuda) backend * ADD: "area" entry for unit testing * [Fix] fix satrn onnxruntime batch inference (open-mmlab#2139) * fix satrn onnxruntime batch inference * add ut * better compare * Update readme intro image and docs (open-mmlab#2175) * update logo * update * update * update * fix ci * Revert "update logo" This reverts commit 6935ff0. * update intro * fix * Add Sdk Doxygen document. (open-mmlab#2157) * doxygen c api * fix ci * update en doc * update zh doc * Add coco-wholebody-hand skeleton (open-mmlab#2186) * update * update * Fix NMS Return Index (open-mmlab#2196) * fix nms index return * fix * fix mmseg out_channels=1 (open-mmlab#2179) * fix * fix * fix * fix * fix * fix android library size (open-mmlab#2095) * fix trt>=8.6 api usage error (open-mmlab#2211) * add sdk profiler docs (open-mmlab#2189) * add sdk profiler docs * fix typos * [Feature] Enable read zip model in prebuild package. (open-mmlab#2185) * support libzip * support windows * fix * update rpath * default build sdk zip model * Fix torch2onnx for pointpillars with multi-level outputs (open-mmlab#2210) * temp fix * fix * update * [Fix] Fix text recog task postprocess (open-mmlab#2209) * update ocr_recog postprocess export * update ocr_recog sdk postprocess * fix read unknown_token * Fix some ut (open-mmlab#2187) * fix vit ut * fix * fix * update logo * Revert "update logo" This reverts commit 2e4fa24. * fix dcn ut * fix * update * fix * Revert "fix" This reverts commit c902cf8. * fix * fix * use torch from docker image * Revert "use torch from docker image" This reverts commit fdb4840. * fix * update * update * update * update * Support deploy of YoloX-Pose (open-mmlab#2184) * dev_mmpose * tide * fix lint * del redundant task and model * fix * test ut * test ut * upload configs * fix * remove debug * fix lint * use mmcv.ops.nms * fix lint * remove loop * debug * test modified ut * fix lint * fix return type * fix * fix rescale * fix * fix pack_result * update batch inference * fix nms and pytorch show_box * fix lint * modify ut * add docstring * modify nms * fix * add openvino config * update docs * fix test_mmpose --------- Co-authored-by: RunningLeon <[email protected]> * Fix reg test for maskrcnn (open-mmlab#2230) * Fix yolox-pose ut (open-mmlab#2231) * update yolox-pose ut * fix lint * fix * bump version to v1.2.0 (open-mmlab#2223) * add is_resize_mask cfg in inst seg (open-mmlab#2229) * add is_resize_mask cfg in inst seg * Update csrc/mmdeploy/codebase/mmdet/instance_segmentation.cpp set is_resize_mask default fasle Co-authored-by: Chen Xin <[email protected]> * Update mmdeploy/codebase/mmdet/deploy/object_detection.py set is_resize_mask default fasle Co-authored-by: Chen Xin <[email protected]> * fix format --------- Co-authored-by: Chen Xin <[email protected]> * fix mmdeploy.version_info (open-mmlab#2215) * fix mmdeploy.version_info * fix mmdeploy.version_info * fix lint --------- Co-authored-by: RunningLeon <[email protected]> * fix yolox-pose link (open-mmlab#2248) * update rknn-toolkit2 version to 1.5 (open-mmlab#2278) * update rknn-toolkit2 version to 1.5 * fix lint * cmake .. * fix int type * fix * Update model-support-list (open-mmlab#2259) * Support onnxruntime fp16 (open-mmlab#2269) * support ort-fp16 * update configs * update * update reg ci * fix mmrotate mmdet3d ort fp16 * fix dead links * Fix rotated nms onnx forward indices err (open-mmlab#2305) * fix pytorch deepcopy trace error (open-mmlab#2319) * feat(demo): add time profile (open-mmlab#2307) * Fix CPP demo code in get_started docs (open-mmlab#2321) * Translate onnxruntime.md and tensorrt.md (open-mmlab#2320) * add kwargs to _scaled_dot_product_attention__tensorrt (open-mmlab#2332) * [Fix] Fix random color for Palette (open-mmlab#2356) * fix(demo): add warmup for wrong profiling result (open-mmlab#2339) * fix(demo): add warmup for wrong profiling result * style(demo/csrc): format cpp code * fix(demo): compile error * Update build_ubuntu_x64_ncnn.py * Update build_ubuntu_x64_ncnn.py * [Docs] Fix mmseg document (open-mmlab#2363) * Update mmseg.md * Update mmseg docs * remove * [MMSIG-MMDeploy-557]Support the deployment of 3D object detection algorithm SMOKE on Jetson (open-mmlab#2312) * [Doc]: Update QR Code (open-mmlab#2371) * [Feature]: Support deployment of panoptic segmentation models (open-mmlab#2347) * add semantic segmentation head Author: Daigo Hirooka <[email protected]> Date: Sun Jun 26 18:25:29 2022 +0900 * add panoptic detection model * add panoptic segmentation configs * support panoptic-fpn * remove interpolate * update * support panoptic-fpn mask2former maskformer * update * support dynamic * update * remove unused rewritings for mask2former * Revert "remove unused rewritings for mask2former" This reverts commit 2b6d24a. * update configs and regs * debug dynamic * fix for panoptic-fpn * update * remove rewritings for mask2former * update reg test config * fix * update docs * fix comments * fix --------- Co-authored-by: Daigo Hirooka <[email protected]> * fix urls in CI and readthedocs (open-mmlab#2364) * debug * fix ut * fix ut for end2end model * fix * update readthedocs * fix * fix skip * enlarge space * fix mmdet export for coreml backend (open-mmlab#2392) * fix mmdet export * fix lint * [Fix]: fix rtmdet-inst sdk (open-mmlab#2343) * support rtmdet-inst sdk * fix batch infer * fix * fix mask resize * fix * update * fix segment fault * fix * fix lint * fix * fix * fix * resolve comments * support deployment of mobilenetv3 (open-mmlab#2256) * Mobilenet V3 to ONNX support * Update supported_models.md * Update supported_models.md * Update mmpretrain.md * Update mmpretrain.md * Update mmpretrain.yml * Update classification.py --------- Co-authored-by: RunningLeon <[email protected]> * docs(build): add more example (open-mmlab#2378) * docs(build): add more example * Update linux-x86_64.md * Update linux-x86_64.md * docs(format): update install doc * Update linux-x86_64.md * Update linux-x86_64.md * Update lint.yml * fix lint --------- Co-authored-by: RunningLeon <[email protected]> * refine gemm in ort mdcn (open-mmlab#2292) * refine gemm in ort mdcn * int64_t -> int32_t * Fix ONNX Runtime Op memory leak (open-mmlab#2406) * Fix sdk error for multi-gpu execution (open-mmlab#2411) * Rename MMEdit to MMagic (open-mmlab#2418) * Support dino from mmdet (open-mmlab#2410) * detr batch infer * support dino * remove dynamic batch * update doc * disable exporting masks for image paddings in multi-batch inference * fix * remove rewriting and move changes to mmdet * fix img load for super-resolution in sdk (open-mmlab#2426) * fix load for super-resolution in sdk * fix lint * fix building deformconv for onnxrubtime on windows(open-mmlab#2432) * fix roi align symbolic function in onnx opset>=16 (open-mmlab#2428) * fix torch2onnx for mmdet3d (open-mmlab#2425) * fix mmdet3d * fix * resolve comments * fix rtmdet ncnn * update docs * Upgrade CI and prebuild to cu118 (open-mmlab#2447) * update * fix * update docs * fix * remove ort-fp16 in reg test * fix build * debug * debug * Revert "debug" This reverts commit 2505464. * Revert "debug" This reverts commit 7779f0b. * update * load ort lib for prebuilt mmdeploy * support coco-wholebody visualization in pose_tracker python demo (open-mmlab#2450) * update * update * Fix conversion with rtmdet-inst, vit, conformer (open-mmlab#2453) * fix * fix scaled_dot_product_attention * Bump version v1.3.0 (open-mmlab#2451) * update * bump version * export get_simcc_maximum for simcc (open-mmlab#2449) * update * update for simcc csrc * fix docker ci * update simcc_label * CodeCamp2023-670 (open-mmlab#2476) * Update the document of building mmdeploy with ncnn backend on Windows platform * fix a bug * fix lint error * fix lint error * CodeCamp2023-671 (open-mmlab#2422) * add nms ops * add some file * new file * some change * Update nms_match.cpp * Update nms_match.cpp * Update __init__.py * Delete test_onnx_match.onnx * Delete tests/test_ops/test_onnx_match.onnx * Update test_nms_match_small.py * Update test_nms_match_small.py * Update nms_match.cpp remove allocate * Update nms_match.py remove some test print * Update test_nms_match_small.py * Update nms_match.cpp * Update nms_match.py * Update test_nms_match_small.py * fix the lint fix the lint * Update test_nms_match_small.py * Update test_nms_match_small.py * Update nms_match.cpp * Update test_nms_match_small.py * Update test_nms_match_small.py add input_names * Update onnxruntime.md * Update onnxruntime.md * Update test_nms_match_small.py * Update onnxruntime.md * Update onnxruntime.md * Update test_nms_match_small.py Add UT in nmsmatch * Update test_nms_match_small.py * Update test_nms_match_small.py * CodeCamp2023-555 (open-mmlab#2469) * support condinst from mmdet * remove * update * update * support batch inference * add condinst head unit testing * fix lint error * remove * fix bug in postprocess * remove * update --------- Co-authored-by: RunningLeon <[email protected]> * add condinst ut & update docs (open-mmlab#2481) * support htc (open-mmlab#2438) * support htc * update mmdet.yml * [Fix] fix the onnx exportation for yoloxpose in mmpose (open-mmlab#2466) * fix the onnx exportation for yoloxpose * remove deprecated func * refine code * fix the rescaling process of top-down models * fix ut * add yoloxpose in regression test * fix comment * rebase & fix conflict * Roman duris/fix jetson wheel platform (open-mmlab#2497) * generate_build_config.py now selects <any> wheel platform for system flag <jetson> * Revert "generate_build_config.py now selects <any> wheel platform for system flag <jetson>" This reverts commit d722bee. * generate_build_config.py now selects <any> wheel platform for system flag <jetson> * use different stream when create handle (open-mmlab#2526) * fix ort wrapper (open-mmlab#2560) * Fix mask dimensions mismatch in rtmdet-ins models after exporting to onnx (open-mmlab#2574) * Added the ability to build a project with PyTorch 2.0. (open-mmlab#2553) * Added the ability to build a project with PyTorch 2.0. Namely, I added the flag -std=c++17 to extra_compile_args depending on the version of Torch. * Lost the condition for the presence of nvcc * Lost the condition for the presence of nvcc * Add parse_version * fix lint --------- Co-authored-by: Xin Chen <[email protected]> * There was an error when using a model from the mmyolo repository during image preprocessing, provided that an array was supplied as input. If you input the path to the image, everything works fine. (open-mmlab#2554) * [Feature] Support ONNX and TensorRT exportation of RTMO models (open-mmlab#2597) * support ONNX&TensorRT exportation of RTMO * add configs for rtmo * replace bbox expansion factor with parameter bbox_padding * refine code * refine comment * apply model.switch_to_deploy in BaseTask.build_pytorch_model * fix lint * add rtmo into regression test * add rtmo with trt backend into regression test * add rtmo into supported model list * Fix "has no attribute startswith" (open-mmlab#2498) The value needs to be converted to string first. * Fix error when default_scope is None (open-mmlab#2499) * Fix inference on list of ndarrays (open-mmlab#2506) * bugfix (open-mmlab#2581) * fix ci (open-mmlab#2609) * fix ci * change to ort fp32 * bump version to v1.13.1 (open-mmlab#2613) * add new dockerfile, add run cmd * change Dockerfile for building mmdeploy image * ls fixes to use new mmpose image * add branch for building image * fix * remove 1.3.2-branch * add mmpose to mmdeploy repo * remove mmpose branch checkout --------- Co-authored-by: tpoisonooo <[email protected]> Co-authored-by: Chen Xin <[email protected]> Co-authored-by: RunningLeon <[email protected]> Co-authored-by: huayuan4396 <[email protected]> Co-authored-by: Carmine Ianni <[email protected]> Co-authored-by: huangjiyi <[email protected]> Co-authored-by: Chen Xin <[email protected]> Co-authored-by: ruisv <[email protected]> Co-authored-by: xiabo123 <[email protected]> Co-authored-by: Tau <[email protected]> Co-authored-by: AllentDan <[email protected]> Co-authored-by: YH <[email protected]> Co-authored-by: fanyunchao <[email protected]> Co-authored-by: blue-q <[email protected]> Co-authored-by: Tianlong Ai <[email protected]> Co-authored-by: zhjunqin <[email protected]> Co-authored-by: zhengjie.xu <[email protected]> Co-authored-by: Daigo Hirooka <[email protected]> Co-authored-by: mpocwierz <[email protected]> Co-authored-by: RunningLeon <[email protected]> Co-authored-by: q.yao <[email protected]> Co-authored-by: Junhwa Song <[email protected]> Co-authored-by: Chen Xin <[email protected]> Co-authored-by: Yang Nie <[email protected]> Co-authored-by: CescMessi <[email protected]> Co-authored-by: nino <[email protected]> Co-authored-by: yinfan98 <[email protected]> Co-authored-by: Renhao Liu <[email protected]> Co-authored-by: Peng Lu <[email protected]> Co-authored-by: roman-duris <[email protected]> Co-authored-by: LuukvandenBent <[email protected]> Co-authored-by: Danil <[email protected]> Co-authored-by: Mohammed Yasin <[email protected]> Co-authored-by: Yang Nie <[email protected]> Co-authored-by: Andrii-Sheba <[email protected]>

use different stream when create handle

366c84f

RunningLeon approved these changes Nov 9, 2023

View reviewed changes

RunningLeon merged commit 2882c64 into open-mmlab:main Nov 9, 2023
20 of 22 checks passed

RunningLeon added enhancement New feature or request SDK labels Nov 9, 2023

irexyc mentioned this pull request Dec 11, 2023

Could mmdeploy support multiple model inference in parallel at the same GPU (TensorRT)？ #2596

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use different stream when create handle #2526

use different stream when create handle #2526

irexyc commented Oct 31, 2023

RunningLeon left a comment

use different stream when create handle #2526

use different stream when create handle #2526

Conversation

irexyc commented Oct 31, 2023

Motivation

Modification

RunningLeon left a comment

Choose a reason for hiding this comment