Intel/dpt-hybrid-midas

81次阅读

Intel/dpt-hybrid-midas

Model Details: DPT-Hybrid

Dense Prediction Transformer (DPT) model trained on 1.4 million images for monocular depth estimation.
It was introduced in the paper Vision Transformers for Dense Prediction by Ranftl et al. (2021) and first released in this repository.
DPT uses the Vision Transformer (ViT) as backbone and adds a neck + head on top for monocular depth estimation.

This repository hosts the “hybrid” version of the model as stated in the paper. DPT-Hybrid diverges from DPT by using ViT-hybrid as a backbone and taking some activations from the backbone.
The model card has been written in combination by the Hugging Face team and Intel.

Model Detail Description
Model Authors – Company Intel
Date December 22, 2022
Version 1
Type Computer Vision – Monocular Depth Estimation
Paper or Other Resources Vision Transformers for Dense Prediction and GitHub Repo
License Apache 2.0
Questions or Comments Community Tab and Intel Developers Discord

前往AI网址导航

正文完
 0
微草录
版权声明:本站原创文章,由 微草录 2024-01-03发表,共计811字。
转载说明:除特殊说明外本站文章皆由CC-4.0协议发布,转载请注明出处。