Welcome to CVNets documentation!

CVNets is an open-source library for training deep neural networks for visual recognition tasks, including classification, detection, and segmentation.

CVNets supports image and video understanding tools, including data loading, data transformations, novel data sampling methods, and implementations of several state-of-the-art networks.

Our source code is available on Github .

Citation

If you find CVNets useful, please cite the following papers:

@inproceedings{mehta2022mobilevit,
    title={MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer},
    author={Sachin Mehta and Mohammad Rastegari},
    booktitle={International Conference on Learning Representations},
    year={2022}
}

@inproceedings{mehta2022cvnets,
    author = {Mehta, Sachin and Abdolhosseini, Farzad and Rastegari, Mohammad},
    title = {CVNets: High Performance Library for Computer Vision},
    year = {2022},
    booktitle = {Proceedings of the 30th ACM International Conference on Multimedia},
    series = {MM '22}
}

Indices and tables