Jacob MarksinVoxel51·6 hours agoTunnel vision in computer vision: can ChatGPT see?In just two weeks, ChatGPT has taken a commanding hold of the public consciousness. More than a million people have “conversed” with OpenAI’s new chatbot, asking it to write poems and college essays, generate recipe ideas, build virtual machines, and oh so much more. It’s been used to write the…Computer Vision21 min readComputer Vision21 min read
Epoch - IIT Hyderabad·4 hours agoImage DenoisingImage denoising isn’t a trivial task. Our goal is to remove the noise from the image while still trying to preserve the detail in the image. One could use some filters like gaussian or median blur, but they don’t work so well on many kinds of images we come across…Computer Vision3 min readComputer Vision3 min read
Konstantinos Gyftodimos·10 hours agoVision Transformer for Binary Classification of Custom Dataset with PyTorchContents: Short description: A short description of ViT. Coding part: Binary Classification with ViT for Custom Dataset. Appendix: ViT hypermeters explanation. Short description: Vision transformers are one of the popular transformers in the field of deep learning. Before the origin of the vision transformers, we had to use convolutional neural networks in computer…Computer Vision6 min readComputer Vision6 min read
Matt DeitkeinAI2 Blog·1 day agoAI2-THOR v5.0: A Major Update for Embodied AI ResearchWe are excited to announce the release of AI2-THOR v5.0, which includes several new features and improvements. One of the major additions in this release is the inclusion of ProcTHOR, a powerful tool for procedural generation of diverse, realistic, interactive, customizable, and performant 3D environments. This allows researchers to sample…Computer Vision2 min readComputer Vision2 min read
Xtreme1inMultisensory Data Training·16 hours agoXtreme1, the First Open-Source Labeling & Annotation and Visualization Project, is debuting at the Linux Foundation AI & DATA Global LandscapeIntroduction Since the launch of BasicAI’s ‘Xtreme1’ on GitHub, hundreds of AI enthusiasts, students, engineers, and experts in the autonomous driving industry have contributed and rated the repository’s page. Even more, experts have joined the Xtreme1 community. …Computer Vision4 min readComputer Vision4 min read
Luc Frachon·11 hours agoThe Intuitive Diffusion Model (Part 1)These days, diffusion models are the foundation of some of the most exciting machine learning applications, especially in prompt-based image generation. Many blogs explain the math and provide code examples. Here, I will try to provide some intuition behind their mathematical formulation. …Computer Vision12 min readComputer Vision12 min read
Jacob MarksinVoxel51·2 days agoWhy 2022 was the most exciting year in computer vision history (so far)The past 12 months have seen rapid advances in computer vision, from the enabling infrastructure, to new applications across industries, to algorithmic breakthroughs in research, to the explosion of AI-generated art. It would be impossible to cover all of these developments in full detail in a single blog post. …Computer Vision9 min readComputer Vision9 min read
Peng Cao·9 hours agoMember-onlyA Step by Step Guide to Detect Available Parking Spots in Real-Time With OpenCVThis article aims to detail steps on how to monitor available parking spots with Python and OpenCV in real-time so as to inform new-coming drivers if they should enter. Let’s dive in… Overview set Region of Interest(ROI) for each parking spot design logic to distinguish available and unavailable spots apply logic…Computer Vision3 min readComputer Vision3 min read
Xtreme1inMultisensory Data Training·17 hours agoUpload Multisensory Data — Xtreme1 Tutorial Series Part 3Xtreme1 is the world’s first open-source platform for Multisensory Training Data. After going open source, we’ve been praised by many engineers and have been inspired to share the knowledge of how to contribute to our project. Please support us by “starring” our GitHub repo: https://github.com/basicai/xtreme1 This guide is part 4…Computer Vision4 min readComputer Vision4 min read
iomniscient·7 hours ago21 Years Old and Growing with our Multi-Sensory AIWe had mentioned that we had just turned 21 and we have been reviewing the 7 technology factors that we believe have contributed to our longevity. Previously we had talked about a couple of them, – our Autonomous AI capability and - the fact that we provided a complete service…Computer Vision1 min readComputer Vision1 min read