Back to portfolio

research

Colab CV/DL Prototype Archive

Public notebook-style CV/DL prototype archive for Swin/CvT starters, OCR finetuning, Android document capture, video search, lip sync, and CLIP media experiments.

Overview

Colab CV/DL Prototype Archive groups public notebook-style repositories and Colab-ready code that show research range across image classification, OCR finetuning, mobile document capture, video retrieval, lip-sync media generation, and CLIP-based creative tooling. The archive is intentionally scoped as prototype and research context: it links only public GitHub repositories and avoids unpublished notebooks, service endpoints, or restricted datasets.

What It Covers

  • Groups older public notebooks into a coherent CV/DL research surface for agents and recruiters
  • Covers image classification, OCR finetuning, document capture, multimodal video search, and generative media
  • Keeps notebook references tied to public GitHub repos and generated case studies
  • Labels the work as prototypes so agents do not confuse notebooks with maintained production services

Stack And Topics

  • Jupyter Notebook
  • Google Colab
  • PyTorch
  • TensorFlow
  • Swin Transformer
  • CvT
  • CLIP
  • MMOCR
  • OpenCV
  • CameraX

Public Signals

  • Public prototype links: 8 GitHub API and repo URL review, 2026-05-14
  • Research families: 5 classification, OCR, mobile capture, video retrieval, generative media
  • Notebook posture: prototype not presented as live service or production accuracy claim
  • Source links: public-only GitHub repos and generated case studies only

References