mii/usls

mirror of https://github.com/mii443/usls.git synced 2025-12-03 02:58:22 +00:00

Go to file

Jamjamjon edc3a8897c 0.0.5 (#24 )

* Using Rayon to accelarate YOLO post-processing

* Refactor YOLO with outputs format

* Optimize `conf * clss` for yolov5 v6 v7

* Add depth-anything-v2

* Update README.md

* Update CHANGELOG.md

2024-07-12 19:46:48 +08:00

.github/workflows

Add MODNet model (#11 )

2024-04-30 15:26:53 +08:00

assets

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

examples

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

scripts

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

src

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

.gitignore

Initial

2024-03-29 15:54:24 +08:00

Cargo.toml

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

CHANGELOG.md

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

LICENSE

Initial commit

2024-03-29 15:36:09 +08:00

README.md

0.0.5 (#24 )

2024-07-12 19:46:48 +08:00

rust-toolchain.toml

Add an option to adjust the line width for Bbox edges (#22 )

2024-06-26 00:00:36 +08:00

README.md

usls

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models including YOLOv5, YOLOv8, YOLOv9, YOLOv10, RTDETR, CLIP, DINOv2, FastSAM, YOLO-World, BLIP, PaddleOCR, Depth-Anything, MODNet and others.

Monocular Depth Estimation

Panoptic Driving Perception	Text-Detection-Recognition

Portrait Matting

Supported Models

Model	Task / Type	Example	CUDA f32	CUDA f16	TensorRT f32	TensorRT f16
YOLOv5	Classification Object Detection Instance Segmentation	demo	✅	✅	✅	✅
YOLOv6	Object Detection	demo	✅	✅	✅	✅
YOLOv7	Object Detection	demo	✅	✅	✅	✅
YOLOv8	Object Detection Instance Segmentation Classification Oriented Object Detection Keypoint Detection	demo	✅	✅	✅	✅
YOLOv9	Object Detection	demo	✅	✅	✅	✅
YOLOv10	Object Detection	demo	✅	✅	✅	✅
RTDETR	Object Detection	demo	✅	✅	✅	✅
FastSAM	Instance Segmentation	demo	✅	✅	✅	✅
YOLO-World	Object Detection	demo	✅	✅	✅	✅
DINOv2	Vision-Self-Supervised	demo	✅	✅	✅	✅
CLIP	Vision-Language	demo	✅	✅	✅ visual ❌ textual	✅ visual ❌ textual
BLIP	Vision-Language	demo	✅	✅	✅ visual ❌ textual	✅ visual ❌ textual
DB	Text Detection	demo	✅	✅	✅	✅
SVTR	Text Recognition	demo	✅	✅	✅	✅
RTMO	Keypoint Detection	demo	✅	✅	❌	❌
YOLOPv2	Panoptic Driving Perception	demo	✅	✅	✅	✅
Depth-Anything (v1, v2)	Monocular Depth Estimation	demo	✅	✅	❌	❌
MODNet	Image Matting	demo	✅	✅	✅	✅

Installation

Refer to ort docs

For Linux or MacOS users

Download from ONNXRuntime Releases

Then linking

export ORT_DYLIB_PATH=/Users/qweasd/Desktop/onnxruntime-osx-arm64-1.17.1/lib/libonnxruntime.1.17.1.dylib

Quick Start

cargo run -r --example yolo   # blip, clip, yolop, svtr, db, ...

Integrate into your own project

1. Add `usls` as a dependency to your project's `Cargo.toml`

cargo add usls

Or you can use specific commit

usls = { git = "https://github.com/jamjamjon/usls", rev = "???sha???"}

2. Build model

let options = Options::default()
    .with_yolo_version(YOLOVersion::V5)  // YOLOVersion: V5, V6, V7, V8, V9, V10, RTDETR
    .with_yolo_task(YOLOTask::Classify)  // YOLOTask: Classify, Detect, Pose, Segment, Obb
    .with_model("xxxx.onnx")?;
let mut model = YOLO::new(options)?;

If you want to run your model with TensorRT or CoreML

let options = Options::default()
    .with_trt(0) // using cuda by default
    // .with_coreml(0)

If your model has dynamic shapes

let options = Options::default()
    .with_i00((1, 2, 4).into()) // dynamic batch
    .with_i02((416, 640, 800).into())   // dynamic height
    .with_i03((416, 640, 800).into())   // dynamic width

If you want to set a confidence for each category

let options = Options::default()
    .with_confs(&[0.4, 0.15]) // class_0: 0.4, others: 0.15

Go check Options for more model options.

3. Load images

Build DataLoader to load images

let dl = DataLoader::default()
    .with_batch(model.batch.opt as usize)
    .load("./assets/")?;

for (xs, _paths) in dl {
    let _y = model.run(&xs)?;
}

Or simply read one image

let x = vec![DataLoader::try_read("./assets/bus.jpg")?];
let y = model.run(&x)?;

4. Annotate and save

let annotator = Annotator::default().with_saveout("YOLO");
annotator.annotate(&x, &y);

5. Get results

The inference outputs of provided models will be saved to Vec<Y>.

You can get detection bboxes with y.bboxes():

let ys = model.run(&xs)?;
for y in ys {
    // bboxes
    if let Some(bboxes) = y.bboxes() {
        for bbox in bboxes {
            println!(
                "Bbox: {}, {}, {}, {}, {}, {}",
                bbox.xmin(),
                bbox.ymin(),
                bbox.xmax(),
                bbox.ymax(),
                bbox.confidence(),
                bbox.id(),
            )
        }
    }
}

Other: Docs

README.md

usls

Supported Models

Installation

Quick Start

Integrate into your own project

1. Add usls as a dependency to your project's Cargo.toml

2. Build model

3. Load images

4. Annotate and save

5. Get results

1. Add `usls` as a dependency to your project's `Cargo.toml`