mii/usls

Fork 0

mirror of https://github.com/mii443/usls.git synced 2025-12-03 02:58:22 +00:00

Go to file

Jamjamjon 25d9088e2e Update Cargo.toml

2024-07-01 17:14:06 +08:00

.github/workflows

Add MODNet model (#11 )

2024-04-30 15:26:53 +08:00

assets

Update ort and improve the speed of preprocessing

2024-05-12 18:12:34 +08:00

examples

Accelerate model pre-processing and post-processing (#23 )

2024-06-30 15:19:34 +08:00

src

Docs update

2024-07-01 16:53:36 +08:00

.gitignore

Initial

2024-03-29 15:54:24 +08:00

Cargo.toml

Update Cargo.toml

2024-07-01 17:14:06 +08:00

CHANGELOG.md

Docs update

2024-07-01 16:53:36 +08:00

convert2f16.py

Add YOLOPv2 & Face-Parsing model (#3 )

2024-04-14 15:15:59 +08:00

LICENSE

Initial commit

2024-03-29 15:36:09 +08:00

README.md

Docs update

2024-07-01 16:53:36 +08:00

rust-toolchain.toml

Add an option to adjust the line width for Bbox edges (#22 )

2024-06-26 00:00:36 +08:00

README.md

usls

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models including YOLOv5, YOLOv8, YOLOv9, YOLOv10, RTDETR, CLIP, DINOv2, FastSAM, YOLO-World, BLIP, PaddleOCR, Depth-Anything, MODNet and others.

Depth-Anything

YOLOP-v2	Text-Detection

Portrait Matting

YOLOv8-Obb

Supported Models

Model	Task / Type	Example	CUDA f32	CUDA f16	TensorRT f32	TensorRT f16
YOLOv5	Classification Object Detection Instance Segmentation	demo	✅	✅	✅	✅
YOLOv8	Object Detection Instance Segmentation Classification Oriented Object Detection Keypoint Detection	demo	✅	✅	✅	✅
YOLOv9	Object Detection	demo	✅	✅	✅	✅
YOLOv10	Object Detection	demo	✅	✅	✅	✅
RTDETR	Object Detection	demo	✅	✅	✅	✅
FastSAM	Instance Segmentation	demo	✅	✅	✅	✅
YOLO-World	Object Detection	demo	✅	✅	✅	✅
DINOv2	Vision-Self-Supervised	demo	✅	✅	✅	✅
CLIP	Vision-Language	demo	✅	✅	✅ visual ❌ textual	✅ visual ❌ textual
BLIP	Vision-Language	demo	✅	✅	✅ visual ❌ textual	✅ visual ❌ textual
DB	Text Detection	demo	✅	✅	✅	✅
SVTR	Text Recognition	demo	✅	✅	✅	✅
RTMO	Keypoint Detection	demo	✅	✅	❌	❌
YOLOPv2	Panoptic Driving Perception	demo	✅	✅	✅	✅
Depth-Anything	Monocular Depth Estimation	demo	✅	✅	❌	❌
MODNet	Image Matting	demo	✅	✅	✅	✅

Installation

Refer to ort guide

For Linux or MacOS users

Firstly, download from latest release from ONNXRuntime Releases

Then linking

export ORT_DYLIB_PATH=/Users/qweasd/Desktop/onnxruntime-osx-arm64-1.17.1/lib/libonnxruntime.1.17.1.dylib

Demo

cargo run -r --example yolov8   # yolov10, blip, clip, yolop, svtr, db, yolo-world, ...

Integrate into your own project

Expand

1. Add `usls` as a dependency to your project's `Cargo.toml`

cargo add usls

Or you can use specific commit

usls = { git = "https://github.com/jamjamjon/usls", rev = "???sha???"}

2. Set `Options` and build model

let options = Options::default()
    .with_model("../models/yolov8m-seg-dyn-f16.onnx");
let mut model = YOLO::new(options)?;

If you want to run your model with TensorRT or CoreML

let options = Options::default()
    .with_trt(0) // using cuda by default
    // .with_coreml(0)

If your model has dynamic shapes

let options = Options::default()
    .with_i00((1, 2, 4).into()) // dynamic batch
    .with_i02((416, 640, 800).into())   // dynamic height
    .with_i03((416, 640, 800).into())   // dynamic width

If you want to set a confidence level for each category

let options = Options::default()
    .with_confs(&[0.4, 0.15]) // class 0: 0.4, others: 0.15

Go check Options for more model options.

3. Prepare inputs, and then you're ready to go

Build DataLoader to load images

let dl = DataLoader::default()
    .with_batch(model.batch.opt as usize)
    .load("./assets/")?;

for (xs, _paths) in dl {
    let _y = model.run(&xs)?;
}

Or simply read one image

let x = vec![DataLoader::try_read("./assets/bus.jpg")?];
let y = model.run(&x)?;

4. Annotate and save results

let annotator = Annotator::default().with_saveout("YOLOv8");
annotator.annotate(&x, &y);

5. Get results

The inference outputs of provided models will be saved to Vec<Y>.

pub struct Y {
    probs: Option<Prob>,
    bboxes: Option<Vec<Bbox>>,
    keypoints: Option<Vec<Vec<Keypoint>>>,
    mbrs: Option<Vec<Mbr>>,
    polygons: Option<Vec<Polygon>>,
    texts: Option<Vec<String>>,
    masks: Option<Vec<Mask>>,
    embedding: Option<Embedding>,
}

You can get detection bboxes with y.bboxes():

let ys = model.run(&xs)?;
for y in ys {
    // bboxes
    if let Some(bboxes) = y.bboxes() {
        for bbox in bboxes {
            println!(
                "Bbox: {}, {}, {}, {}, {}, {}",
                bbox.xmin(),
                bbox.ymin(),
                bbox.xmax(),
                bbox.ymax(),
                bbox.confidence(),
                bbox.id(),
            )
        }
    }
}

More Bbox methods here: src/ys/bbox.rs

Other tasks results can be found at: src/ys/y.rs

Solution Models

Additionally, this repo also provides some solution models.

Model	Example	Result
Lane Line Segmentation Drivable Area Segmentation Car Detection 车道线-可行驶区域-车辆检测	demo
Face Parsing 人脸解析	demo
Text Detection (PPOCR-det v3, v4) 通用文本检测	demo
Text Recognition (PPOCR-rec v3, v4) 中英文-文本识别	demo
Face-Landmark Detection 人脸 & 关键点检测	demo
Head Detection 人头检测	demo
Fall Detection 摔倒检测	demo
Trash Detection 垃圾检测	demo

README.md

usls

Supported Models

Installation

Demo

Integrate into your own project

1. Add usls as a dependency to your project's Cargo.toml

2. Set Options and build model

3. Prepare inputs, and then you're ready to go

4. Annotate and save results

5. Get results

Solution Models

1. Add `usls` as a dependency to your project's `Cargo.toml`

2. Set `Options` and build model