You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

UnCageNet

UnCageNet is a computer vision framework for robust animal tracking and pose estimation in caged environments, where occlusions caused by cage bars significantly degrade the performance of existing methods.

This repository provides the official implementation of the paper:

UnCageNet: Tracking and Pose Estimation of Caged Animal
Sayak Dutta, Harish Katti, Shashikant Verma, Shanmuganathan Raman
arXiv: https://arxiv.org/abs/2512.07712

πŸ”— Code: https://github.com/itz-sayak/UnCageNet


πŸ” Method Overview

UnCageNet introduces a three-stage preprocessing pipeline that improves downstream tracking and pose estimation under structured occlusions:

  1. Cage Segmentation

    • Gabor-enhanced ResNet-UNet
    • Orientation-aware filters (72 directional kernels)
    • Accurate detection of cage bar structures
  2. Cage Inpainting

    • Content-aware reconstruction using CRFill
    • Removes structured occlusions while preserving animal appearance
  3. Downstream Evaluation

    • Standard pose estimation and tracking models (e.g., STEP, ViTPose)
    • Applied on β€œuncaged” frames for fair performance comparison

This pipeline enables performance comparable to uncaged environments, despite heavy occlusions.


πŸ“Š Experimental Highlights

  • Significant improvement in:
    • Keypoint detection accuracy
    • Trajectory consistency
  • Robust performance across:
    • Severe occlusion patterns
    • Long video sequences
  • Plug-and-play compatibility with existing tracking and pose models

(Refer to the paper for full quantitative results.)


πŸ’‘ Intended Use

UnCageNet is intended for:

  • Animal behavior analysis
  • Zoological and veterinary monitoring
  • Laboratory animal studies
  • Long-term tracking in constrained environments

⚠️ Limitations

  • Assumes structured occlusions (e.g., cage bars)
  • Performance may degrade for:
    • Highly deformable or unstructured occluders
    • Extremely low-resolution video
  • Not trained for arbitrary object categories beyond animals

πŸ“„ Citation

If you use this work, please cite:

@article{dutta2025uncagenet,
  title   = {UnCageNet: Tracking and Pose Estimation of Caged Animal},
  author  = {Dutta, Sayak and Katti, Harish and Verma, Shashikant and Raman, Shanmuganathan},
  journal = {arXiv preprint arXiv:2512.07712},
  year    = {2025}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support