• IEEE CS Standards
  • Career Center
  • Subscribe to Newsletter
  • IEEE Standards

computer vision research papers ieee

  • For Industry Professionals
  • For Students
  • Launch a New Career
  • Membership FAQ
  • Membership FAQs
  • Membership Grades
  • Special Circumstances
  • Discounts & Payments
  • Distinguished Contributor Recognition
  • Grant Programs
  • Find a Local Chapter
  • Find a Distinguished Visitor
  • Find a Speaker on Early Career Topics
  • Technical Communities
  • Collabratec (Discussion Forum)
  • Start a Chapter
  • My Subscriptions
  • My Referrals
  • Computer Magazine
  • ComputingEdge Magazine
  • Let us help make your event a success. EXPLORE PLANNING SERVICES
  • Events Calendar
  • Calls for Papers
  • Conference Proceedings
  • Conference Highlights
  • Top 2024 Conferences
  • Conference Sponsorship Options
  • Conference Planning Services
  • Conference Organizer Resources
  • Virtual Conference Guide
  • Get a Quote
  • CPS Dashboard
  • CPS Author FAQ
  • CPS Organizer FAQ
  • Find the latest in advanced computing research. VISIT THE DIGITAL LIBRARY
  • Open Access
  • Tech News Blog
  • Author Guidelines
  • Reviewer Information
  • Guest Editor Information
  • Editor Information
  • Editor-in-Chief Information
  • Volunteer Opportunities
  • Video Library
  • Member Benefits
  • Institutional Library Subscriptions
  • Advertising and Sponsorship
  • Code of Ethics
  • Educational Webinars
  • Online Education
  • Certifications
  • Industry Webinars & Whitepapers
  • Research Reports
  • Bodies of Knowledge
  • CS for Industry Professionals
  • Resource Library
  • Newsletters
  • Women in Computing
  • Digital Library Access
  • Organize a Conference
  • Run a Publication
  • Become a Distinguished Speaker
  • Participate in Standards Activities
  • Peer Review Content
  • Author Resources
  • Publish Open Access
  • Society Leadership
  • Boards & Committees
  • Special Technical Communities
  • Local Chapters
  • Governance Resources
  • Conference Publishing Services
  • Chapter Resources
  • About the Board of Governors
  • Board of Governors Members
  • Diversity & Inclusion
  • Open Volunteer Opportunities
  • Award Recipients
  • Student Scholarships & Awards
  • Nominate an Election Candidate
  • Nominate a Colleague
  • Corporate Partnerships
  • Conference Sponsorships & Exhibits
  • Advertising
  • Recruitment
  • Publications
  • Education & Career

Resources for Computer Vision Professionals

With the ever-growing interest in computer vision, the research, applications, and commercial possibilities for this technology are immense. discover how the world of computer vision is evolving and explore the career opportunities that are newly emerging., page content:, what is computer vision, the fundamentals of computer vision, where is computer vision headed, transportation & aviation, security & privacy, entertainment, agriculture, career opportunities, computer vision engineers, xr design/graphics engineers, data visualization engineers, challenges and limitations of computer vision technology, ethics, standards, diversity, and inclusion, ethics in computer vision, standards & inclusion in xr, diversity in visualization research, voices from the community, ieee computer society fellow: greg welch.

  • No results found.

On this resource page you’ll learn…

  • Foundations of Computer Vision: Understand the core principles of computer vision and gain insights into how these systems work.
  • Market Projections: Gain insight into the anticipated growth of the computer vision market, set to exceed USD $20.88 billion by 2030, with impacts on key domains such as transportation , healthcare , security , entertainment , and agriculture .
  • Opportunities in Research and Development: Learn about the increasing demand for research and development in the expanding landscape of computer vision, and discover the rising job opportunities within this dynamic field.
  • Industry Impact and Challenges : Uncover the transformative effects of computer vision across various sectors, while acknowledging the existing limitations and barriers that require attention.
  • Ethical Considerations: Examine the ethical concerns of computer vision, including the pressing need for transparency, fairness, accountability, privacy, and the adoption of best practices to ensure responsible deployment.

Back to Top

“‘Intelligent’ computers require knowledge of their environment, and the most effective means of acquiring such knowledge is by seeing. Vision opens a new realm of computer applications,” Computer magazine, May 1973.

Grounded in the principles of artificial intelligence (AI), computer vision provides machines the capability to perceive and analyze visual data such as images, graphics, and videos. The intention is similar to AI — to automate decisions — yet its area of focus is exclusive to activities a human’s visual system would generally conduct. IBM describes the contrast lucidly: “If AI enables computers to think, computer vision enables them to see, observe, and understand.”

Computer vision, which seems like a modern innovation, is the outcome of extensive research stretching back to the 1960s. First coming into discovery with Seymour Papert’s Summer Vision Project of 1966, computer vision has been in development for decades, improving all along the way and creating new possibilities for everyone. Though complex, the process of these systems can be broken down into four fundamental steps:

  • Visual data such as images or video is taken into the computer vision systems as input. Since images are made up of pixels, these machines process information at the pixel level.
  • To analyze the data, distinctive features in the image, such as contours, corners, or colors, are identified using algorithms and models.
  • Through the process of identification, the computer recognizes objects such as people, as well as certain behaviors in the visuals. With the powers of machine learning, the computer can improve this ability over time.
  • Finally, the computer can provide an output based on this interpretation. To be put simply, this is when the computer communicates what it’s seeing.

Before the technology of computer vision came to today’s application methods, there were of course key pioneers that led the way first. For example, the Optical Character Recognition system was developed by Ray Kurzweil of Kurzweil Computer Products, Inc. in 1974. This system could recognize and process printed text, no matter the font and without manual entry. When placed in a machine learning format and enhanced with text-to-speech features, the technology was used to read for the blind.

This is just one pivotal example of the many applications that display the power and impact of computer vision. Thanks to waves of developments and crucial research, the technology has improved several domains of human life including transportation, healthcare, security, entertainment, and agriculture. Because of this, it is no surprise that the market of computer vision is expected to expand in the very near future.

According to the Top Trends in Computer Vision Report , which reviews the latest trends covered at the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , the computer vision industry raked in over $12.14 billion USD in 2022 and has a 7% projected growth rate with $20.88 billion USD expected by 2030.

The revenue is projected to increase due to the surging need for the technology in various fields, like transportation, healthcare, and security. Moreover, according to PS Market Research , XR entertainment systems which were worth $38.3 billion in 2022 are predicted to reach an immense value of $394.8 billion by 2030.

Discover the Future of Computer Vision at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  • The U.S. National Highway Traffic Safety Administration (NHTSA) has reported that 94% of critical collisions are caused by human error. With the help of computer vision, advanced cameras and sensors allow vehicles to analyze surroundings, detect objects such as pedestrians and other vehicles, and safely navigate around them. Furthermore, the technology is also used within the aviation sector to create flight simulators. Within these sectors, Extended Reality (XR) is also used to simulate flight training while reducing costs, time, and possible damages to aircraft.
  • Toward Fully Autonomous and Networked Vehicles
  •  Autonomous Driving Technologies Special Technical Community
  • Using Extended Reality in Flight Simulators: A Literature Review

Learn more about computer vision and automated vehicles by taking the IEEE course on ‘Using Machine Vision Perception to Control Automated Vehicle Maneuvering’

  • Computer vision is also the technology to thank for an improved patient experience within the healthcare system. This includes medical treatments and procedures. Specifically, computer vision has transformed the capabilities of medical imaging data , which allows practitioners to diagnose, monitor, or treat medical conditions. The technology also permits augmented reality (AR)-assisted surgical guidance , which can visualize human anatomy and aid practitioners when performing operations such as neurosurgical procedures.
  • AR-Assisted Surgical Guidance System for Ventriculostomy
  • Augmented and Virtual Reality in Surgery
  • Standardizing 3D Medical Imaging
  • Driven by progress made within machine learning, edge computing, IoT, and AI, computer vision enables the capability to mitigate security threats in real time. For example, with the help of image processing and statistical pattern recognition, biometrics allow computers to recognize persons based on physiological characteristics, such as faces or fingerprints. Additionally, computer vision aids security within smart security surveillance . This includes cameras that are placed in different areas within a city that monitor and detect threatening behavior. Attracting more attention is privacy-preserving biometrics as it may be used to resolve concerns related to cryptographic authentication processes.
  • The Interplay of AI and Biometrics: Challenges and Opportunities
  • Biometrics and Privacy-Preservation: How Do They Evolve?
  • Biometrics Based Access Framework for Secure Cloud Computing

XR gaming blurs the line between virtual and physical realities, simulating new worlds and adventures for players to be fully immersed within. According to XR Today , the technology has provided the capability to transform social gatherings by giving its users the ability to create virtual events and exhibitions anywhere at any time.

  • Virtual Reality: A Journey from Vision to Commodity
  • Affective Virtual Reality: How to Design Artificial Experiences Impacting Human Emotions

Learn More About Virtual Reality and its Applications at IEEE VR 2024

  • According to researchers, insects affect 35% of farmland. Understanding and monitoring how insects play a role in agriculture is vital for food production, however, can be very labor-intensive and may even be unreliable at times. Computer vision can potentially improve this process by monitoring it automatically. On top of that, computer vision offers the opportunity to give automated machine systems ‘eyes’, enabling them to navigate fields, without manual labor.
  • Towards Computer Vision and Deep Learning Facilitated Pollination Monitoring for Agriculture
  • The 1st Agriculture-Vision Challenge: Methods and Results
  • Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis

According to the US Bureau of Labor Statistics , the employment of professionals in the computer and information science industry is expected to increase significantly over the next decade, reaching a 21% rise by 2031. To fill these new roles, experts in computer vision, extended reality (XR), and data visualization will be needed.

  • Computer vision engineers work in highly collaborative environments, usually guided by the needs of their clients. In addition to building architectures and using algorithms, their typical areas of expertise include image classification, face detection, pose estimation, and optical flow . Within this field, time is mainly spent developing models, retraining them, and creating reliable datasets.
  • Skills: Developing image analysis algorithms, deep learning architectures, image processing and visualization, computer vision libraries, and data flow programming Salary: $160K USD (This is a salary estimation for United States employees according to talent.com . View estimates for other countries via Salary Expert .)
  • Degree: Bachelor’s in mathematics, computer vision, computer science, machine learning, information systems
  • IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
  • IEEE/CVF International Conference on Computer Vision
  • Technical Community on Pattern Analysis and Machine Intelligence
  • Those within the XR industry, such as XR Design/Graphics Engineers , use their knowledge of computer vision to bring creative projects to life. Furthermore, they research and develop technology that augments reality, re-creates real-life environments, or generates other spaces that users can interact with virtually. Working cross functionally with creative teams, they use their knowledge within computer vision to help aid the design, optimization, integration, and testing of XR devices and products such as video games and other entertainment systems.
  • Skills: 3D visualization tools/art, coding languages such as python, C/C++ programming, and/or Java, Linear algebra, multimedia software stacks and frameworks
  • Salary: $107,000 USD (This is a salary estimation for United States employees according to circuitstream.com . View estimates for other countries via Salary Expert .)
  • Degree: Bachelor’s in Computer engineering, mathematics, or related fields of study. Master’s in Human Centered Design and Engineering or Interaction Design
  • IEEE Virtual Reality 2024
  • Technical Community on Intelligent Informatics
  • The power of visualizing data helps decision makers to recognize and address patterns and mistakes in their information, allowing them to make educated choices for their organization. Data visualization engineers create visual representations of data, then build dashboards for different business departments to inspect. They play a pivotal role in the process of informed decision-making.
  • Skills: Business Intelligence (BI) tools, Data analysis, python-based visualizations, Data Visualization Tools such as Tableau, Yellowfin, and Qlik Sense, and mathematics/statistics
  • Salary: $96,317 (This is a salary estimation for United States employees according to salary.com . View estimates for other countries via Salary Expert .)
  • Degree: bachelor’s degree in computer science, computer information systems, software engineering, or a closely related field. Master’s degree in Data Analytics or Visualization
  • IEEE VIS: Visualization & Visual Analytics
  • Technical Community on Visualization and Graphics

While computer vision has made significant improvements, challenges still prevail, emphasizing the necessity for continuous research and development in the field. This includes concerns related to data quality and bias. It’s important to note that any technology created or managed by humans is susceptible to biases. To ensure accurate detections and optimal functionality, these systems must be developed with diversity in inputs.

Moreover, the question remains: Can a computer not only perceive but truly comprehend its observations? It is crucial to instill trust in these systems, ensuring they understand what they observe with minimal errors and increased adoption to be accurate.

Lastly, security and privacy stand as major considerations for any widely adapted technology. However, these aspects continue to be challenging with room for improvement. In the context of facial recognition, this issue becomes particularly pronounced and ongoing, necessitating scrutiny and improvement.

As the usage of computer vision technology progresses, ethics considerations have begun dominating the discussion. It’s crucial to examine specifics related to computer vision rather than depending on the general ethics linked to AI. These conversations are taking place during conferences, standards development and working groups, and research projects.

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) aims to initiate further discussion within computer vision applications and research. In 2022, it was encouraged that researchers submit papers and proposals including potential negative societal impacts of their proposed research and possible methods on how to mitigate them. Potential ethical concerns include the safety of living beings, privacy, environmental impact, and economic security.

The organizers prioritized transparency and stated, “Grappling with ethics is a difficult problem for the field [computer vision], and thinking about ethics is still relatively new to many authors… In certain cases, it will not be possible to draw a bright line between ethical and unethical.”

The committee of IEEE/CVF CVPR 2023 planed to continue this conversation for the next annual conference and called for papers that focus on transparency, fairness, accountability, privacy, and ethics in vision.

Specifically, in regard to ethics for XR, IEEE is laying down the foundation with standardization. As stated in IEEE Spectrum , “… the IEEE Standards Association (IEEE SA) is working to help define, develop, and deploy the technologies, applications, and governance practices needed to help turn metaverse concepts into practical realities, and to drive new markets.”

It’s also vital to keep in mind that this cutting-edge technology should be made accessible. For instance, it needs to accommodate people who are visually impaired . The study “ Toward inclusivity: Virtual Reality Museums for the Visually Impaired ” examines how narrations, spatialized “reference” audio, along with haptic feedback can be an effective replacement for the traditional use of vision in a virtual reality. The study discovered that those with visual impairments could locate objects more quickly with the aid of enhanced audio and tactile feedback.

Lastly, IEEE Transactions on Visualization and Computer Graphics ( IEEE TVCG ) conducted an analysis of gender representation among the attendees, organizers, and presenters at the IEEE Visualization (VIS) conference over the last 30 years. It was found that the proportion of female authors has increased from 9% in the first five years to 22% in the last five years of the conference.

The IEEE Computer Society urges academics and practitioners to send any ideas that may advance the dialogue to [email protected] since, it is efforts such as these, that have the potential to push the industry towards a brighter future.

IEEE Computer Society Fellow and computer scientist engineer, Greg Welch, is the AdventHealth Endowed Chair in Healthcare Simulation in UCF’s College of Nursing in addition to being co-director of the UCF Synthetic Reality Laborator y. In 2021, Welch reached fellowship status, for contributions to tracking methods in augmented reality applications . Specifically, his primary area of study is virtual reality (VR) and augmented reality (AR), collectively known as “XR,” with a focus in both hardware and software applications.

Currently, Welch spends his time researching the way humans perceive AR related experiences when interacting with the technology. Additionally, he is the lead of the pending NSF project, “Virtual Experience Research Accelerator (VERA),” a system that will improve the process of generating VR related research for scientists.

When asked what advice Welch had for readers with an interest in pursuing a similar path, he mentioned how beneficial ongoing exploration can be, “The field changes fast — something that is hot today might not be tomorrow. In addition, a broader perspective can enable one to see connections and opportunities.”

He recommends taking advantage of community resources and networking opportunities, “From an experiential perspective, get involved! The community [IEEE Computer Society] would not exist without volunteers, but there are so many benefits — it really is true that you get out what you put in.”

Inside the Computer Society

computer vision research papers ieee

Expo and Leadership Forum | 27-28 August 2024

computer vision research papers ieee

Our Commitment to equity, diversity, and inclusion

computer vision research papers ieee

CS Members can now add full CSDL access for one flat rate! Use promo code CSDLTRACK

computer vision research papers ieee

Software Engineering Radio: The Podcast for Professional Software Developers

computer vision research papers ieee

Sign up for our newsletter.

EMAIL ADDRESS

computer vision research papers ieee

IEEE COMPUTER SOCIETY

  • Board of Governors
  • IEEE Support Center

DIGITAL LIBRARY

  • Librarian Resources

COMPUTING RESOURCES

  • Courses & Certifications

COMMUNITY RESOURCES

  • Conference Organizers
  • Communities

BUSINESS SOLUTIONS

  • Conference Sponsorships & Exhibits
  • Digital Library Institutional Subscriptions
  • Accessibility Statement
  • IEEE Nondiscrimination Policy
  • XML Sitemap

©IEEE — All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.

A not-for-profit organization, the Institute of Electrical and Electronics Engineers (IEEE) is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

CVF

CVF Sponsored Conferences

Cvf sponsored conferences errata.

It is the policy of the Computer Vision Foundation to maintain PDF copies of conference papers as submitted during the camera-ready paper collection. These papers are considered the final published versions of the work. We recognize the need for minor corrections after publication, and thus provide links to arXiv versions of the papers where available. If a correction must be made, it should be made as an update to the arXiv version of the paper by the authors. The CVF maintainers should then be notified of the update via email ( [email protected] ). The conference open access website will be updated periodically to indicate changes made to an arXiv version since the original conference publication date. The original camera-ready version of the paper will be maintained within the open access archive, and will not be removed or replaced by request.

Other Computer Vision Conferences and Workshops

Subscribe to the PwC Newsletter

Join the community, computer vision, semantic segmentation.

computer vision research papers ieee

Tumor Segmentation

computer vision research papers ieee

Panoptic Segmentation

computer vision research papers ieee

3D Semantic Segmentation

computer vision research papers ieee

Weakly-Supervised Semantic Segmentation

Classification.

computer vision research papers ieee

Text Classification

computer vision research papers ieee

Graph Classification

computer vision research papers ieee

Audio Classification

computer vision research papers ieee

Medical Image Classification

Representation learning.

computer vision research papers ieee

Disentanglement

Graph representation learning, sentence embeddings.

computer vision research papers ieee

Network Embedding

Object detection.

computer vision research papers ieee

3D Object Detection

computer vision research papers ieee

Real-Time Object Detection

computer vision research papers ieee

RGB Salient Object Detection

computer vision research papers ieee

Few-Shot Object Detection

Image classification.

computer vision research papers ieee

Out of Distribution (OOD) Detection

computer vision research papers ieee

Few-Shot Image Classification

computer vision research papers ieee

Fine-Grained Image Classification

computer vision research papers ieee

Semi-Supervised Image Classification

Reinforcement learning (rl), off-policy evaluation, multi-objective reinforcement learning, 3d point cloud reinforcement learning, 2d object detection.

computer vision research papers ieee

Edge Detection

computer vision research papers ieee

Open Vocabulary Object Detection

computer vision research papers ieee

Semi-Supervised Object Detection

Deep hashing, table retrieval, domain adaptation.

computer vision research papers ieee

Unsupervised Domain Adaptation

computer vision research papers ieee

Domain Generalization

computer vision research papers ieee

Source-Free Domain Adaptation

Universal domain adaptation, image generation.

computer vision research papers ieee

Image-to-Image Translation

computer vision research papers ieee

Image Inpainting

computer vision research papers ieee

Text-to-Image Generation

computer vision research papers ieee

Conditional Image Generation

Data augmentation.

computer vision research papers ieee

Image Augmentation

computer vision research papers ieee

Text Augmentation

Autonomous vehicles.

computer vision research papers ieee

Autonomous Driving

computer vision research papers ieee

Self-Driving Cars

computer vision research papers ieee

Simultaneous Localization and Mapping

computer vision research papers ieee

Autonomous Navigation

computer vision research papers ieee

Image Denoising

computer vision research papers ieee

Color Image Denoising

computer vision research papers ieee

Sar Image Despeckling

Grayscale image denoising, meta-learning.

computer vision research papers ieee

Few-Shot Learning

computer vision research papers ieee

Sample Probing

computer vision research papers ieee

Depth Estimation

computer vision research papers ieee

Style Transfer

computer vision research papers ieee

3D Reconstruction

computer vision research papers ieee

Neural Rendering

computer vision research papers ieee

3D Face Reconstruction

Contrastive learning.

computer vision research papers ieee

Super-Resolution

computer vision research papers ieee

Image Super-Resolution

computer vision research papers ieee

Video Super-Resolution

computer vision research papers ieee

Multi-Frame Super-Resolution

computer vision research papers ieee

Reference-based Super-Resolution

Pose estimation.

computer vision research papers ieee

3D Human Pose Estimation

computer vision research papers ieee

Keypoint Detection

computer vision research papers ieee

3D Pose Estimation

computer vision research papers ieee

6D Pose Estimation

computer vision research papers ieee

Text-based Image Editing

Text-guided-image-editing.

computer vision research papers ieee

Zero-Shot Text-to-Image Generation

Concept alignment, 3d architecture, 2d semantic segmentation, image segmentation.

computer vision research papers ieee

Text-To-SQL

Text style transfer.

computer vision research papers ieee

Scene Parsing

Self-supervised learning.

computer vision research papers ieee

Point Cloud Pre-training

Unsupervised video clustering, visual question answering (vqa).

computer vision research papers ieee

Visual Question Answering

computer vision research papers ieee

Machine Reading Comprehension

computer vision research papers ieee

Chart Question Answering

computer vision research papers ieee

Embodied Question Answering

Sentiment analysis.

computer vision research papers ieee

Aspect-Based Sentiment Analysis (ABSA)

computer vision research papers ieee

Multimodal Sentiment Analysis

computer vision research papers ieee

Aspect Sentiment Triplet Extraction

computer vision research papers ieee

Twitter Sentiment Analysis

Anomaly detection.

computer vision research papers ieee

Unsupervised Anomaly Detection

computer vision research papers ieee

One-Class Classification

Supervised anomaly detection, anomaly detection in surveillance videos.

computer vision research papers ieee

Temporal Action Localization

computer vision research papers ieee

Video Understanding

computer vision research papers ieee

Video Object Segmentation

computer vision research papers ieee

Action Classification

Video generation, activity recognition.

computer vision research papers ieee

Action Recognition

computer vision research papers ieee

Human Activity Recognition

Egocentric activity recognition.

computer vision research papers ieee

Group Activity Recognition

computer vision research papers ieee

One-Shot Learning

computer vision research papers ieee

Few-Shot Semantic Segmentation

Cross-domain few-shot.

computer vision research papers ieee

Unsupervised Few-Shot Learning

3d object super-resolution, medical image segmentation.

computer vision research papers ieee

Lesion Segmentation

computer vision research papers ieee

Brain Tumor Segmentation

computer vision research papers ieee

Cell Segmentation

computer vision research papers ieee

Brain Segmentation

Monocular depth estimation.

computer vision research papers ieee

Stereo Depth Estimation

Depth and camera motion.

computer vision research papers ieee

3D Depth Estimation

Exposure fairness, optical character recognition (ocr).

computer vision research papers ieee

Active Learning

computer vision research papers ieee

Handwriting Recognition

Handwritten digit recognition, irregular text recognition, facial recognition and modelling.

computer vision research papers ieee

Face Recognition

computer vision research papers ieee

Face Swapping

computer vision research papers ieee

Face Detection

computer vision research papers ieee

Face Verification

computer vision research papers ieee

Facial Expression Recognition (FER)

Instance segmentation.

computer vision research papers ieee

Referring Expression Segmentation

computer vision research papers ieee

3D Instance Segmentation

computer vision research papers ieee

Real-time Instance Segmentation

computer vision research papers ieee

Unsupervised Object Segmentation

Object tracking.

computer vision research papers ieee

Multi-Object Tracking

computer vision research papers ieee

Visual Object Tracking

computer vision research papers ieee

Multiple Object Tracking

computer vision research papers ieee

Cell Tracking

Zero-shot learning.

computer vision research papers ieee

Generalized Zero-Shot Learning

computer vision research papers ieee

Compositional Zero-Shot Learning

Multi-label zero-shot learning.

computer vision research papers ieee

Action Recognition In Videos

Self-supervised action recognition.

computer vision research papers ieee

3D Action Recognition

Few shot action recognition, quantization, data free quantization, unet quantization, continual learning.

computer vision research papers ieee

Class Incremental Learning

Continual named entity recognition, unsupervised class-incremental learning.

computer vision research papers ieee

Scene Understanding

computer vision research papers ieee

Scene Text Recognition

computer vision research papers ieee

Scene Graph Generation

computer vision research papers ieee

Scene Recognition

Adversarial attack.

computer vision research papers ieee

Backdoor Attack

computer vision research papers ieee

Adversarial Text

Adversarial attack detection, real-world adversarial attack, active object detection, image retrieval.

computer vision research papers ieee

Sketch-Based Image Retrieval

computer vision research papers ieee

Content-Based Image Retrieval

computer vision research papers ieee

Composed Image Retrieval (CoIR)

computer vision research papers ieee

Medical Image Retrieval

Dimensionality reduction.

computer vision research papers ieee

Supervised dimensionality reduction

Online nonnegative cp decomposition.

computer vision research papers ieee

Image Stylization

Font style transfer, style generalization, face transfer, optical flow estimation.

computer vision research papers ieee

Video Stabilization

computer vision research papers ieee

Monocular 3D Object Detection

computer vision research papers ieee

3D Object Detection From Stereo Images

computer vision research papers ieee

Multiview Detection

Robust 3d object detection, emotion recognition.

computer vision research papers ieee

Speech Emotion Recognition

computer vision research papers ieee

Emotion Recognition in Conversation

computer vision research papers ieee

Multimodal Emotion Recognition

Emotion-cause pair extraction, image reconstruction.

computer vision research papers ieee

MRI Reconstruction

Action localization.

computer vision research papers ieee

Action Segmentation

Spatio-temporal action localization, person re-identification.

computer vision research papers ieee

Unsupervised Person Re-Identification

Video-based person re-identification, generalizable person re-identification, cloth-changing person re-identification, image captioning.

computer vision research papers ieee

3D dense captioning

Controllable image captioning, aesthetic image captioning.

computer vision research papers ieee

Relational Captioning

Action detection.

computer vision research papers ieee

Skeleton Based Action Recognition

computer vision research papers ieee

Online Action Detection

Audio-visual active speaker detection, visual relationship detection, lighting estimation.

computer vision research papers ieee

3D Room Layouts From A Single RGB Panorama

Road scene understanding, metric learning.

computer vision research papers ieee

Image Restoration

computer vision research papers ieee

Demosaicking

Spectral reconstruction, underwater image restoration.

computer vision research papers ieee

JPEG Artifact Correction

Object recognition.

computer vision research papers ieee

3D Object Recognition

Continuous object recognition.

computer vision research papers ieee

Depiction Invariant Object Recognition

computer vision research papers ieee

Monocular 3D Human Pose Estimation

Pose prediction.

computer vision research papers ieee

3D Multi-Person Pose Estimation

3d human pose and shape estimation, image enhancement.

computer vision research papers ieee

Low-Light Image Enhancement

Image relighting, de-aliasing, continuous control.

computer vision research papers ieee

Steering Control

Drone controller, 3d face modelling.

computer vision research papers ieee

Semi-Supervised Video Object Segmentation

computer vision research papers ieee

Unsupervised Video Object Segmentation

computer vision research papers ieee

Referring Video Object Segmentation

computer vision research papers ieee

Video Salient Object Detection

Multi-label classification.

computer vision research papers ieee

Extreme Multi-Label Classification

Medical code prediction, hierarchical multi-label classification, trajectory prediction.

computer vision research papers ieee

Trajectory Forecasting

Human motion prediction.

computer vision research papers ieee

Multivariate Time Series Imputation

Object localization.

computer vision research papers ieee

Weakly-Supervised Object Localization

Image-based localization, unsupervised object localization, monocular 3d object localization, out-of-distribution detection, image quality assessment, no-reference image quality assessment, blind image quality assessment.

computer vision research papers ieee

Aesthetics Quality Assessment

Stereoscopic image quality assessment.

computer vision research papers ieee

Blind Image Deblurring

Single-image blind deblurring, video semantic segmentation.

computer vision research papers ieee

Camera shot segmentation

Cloud removal.

computer vision research papers ieee

Facial Inpainting

computer vision research papers ieee

Fine-Grained Image Inpainting

Novel view synthesis.

computer vision research papers ieee

Gournd video synthesis from satellite image

Saliency detection.

computer vision research papers ieee

Saliency Prediction

computer vision research papers ieee

Co-Salient Object Detection

Video saliency detection, change detection.

computer vision research papers ieee

Semi-supervised Change Detection

Image compression.

computer vision research papers ieee

Feature Compression

Jpeg compression artifact reduction.

computer vision research papers ieee

Lossy-Compression Artifact Reduction

Color image compression artifact reduction, explainable artificial intelligence, explainable models, explanation fidelity evaluation, fad curve analysis, salient object detection, saliency ranking, ensemble learning, visual reasoning.

computer vision research papers ieee

Visual Commonsense Reasoning

Instruction following, visual instruction following, image registration.

computer vision research papers ieee

2D Classification

computer vision research papers ieee

Neural Network Compression

computer vision research papers ieee

Music Source Separation

Cell detection.

computer vision research papers ieee

Plant Phenotyping

Open-set classification, visual tracking.

computer vision research papers ieee

Point Tracking

Real-time visual tracking, rgb-t tracking.

computer vision research papers ieee

RF-based Visual Tracking

Image manipulation detection.

computer vision research papers ieee

Generalized Zero Shot skeletal action recognition

Zero shot skeletal action recognition, motion estimation, activity prediction, motion prediction, cyber attack detection, sequential skip prediction, 3d point cloud classification.

computer vision research papers ieee

3D Object Classification

computer vision research papers ieee

Few-Shot 3D Point Cloud Classification

Zero-shot transfer 3d point cloud classification, prompt engineering.

computer vision research papers ieee

Visual Prompting

computer vision research papers ieee

Robust 3D Semantic Segmentation

computer vision research papers ieee

Real-Time 3D Semantic Segmentation

computer vision research papers ieee

Unsupervised 3D Semantic Segmentation

Furniture segmentation, gesture recognition.

computer vision research papers ieee

Hand Gesture Recognition

computer vision research papers ieee

Hand-Gesture Recognition

computer vision research papers ieee

RF-based Gesture Recognition

Whole slide images, point cloud registration.

computer vision research papers ieee

Image to Point Cloud Registration

Video captioning.

computer vision research papers ieee

Dense Video Captioning

Boundary captioning, visual text correction, audio-visual video captioning, 3d point cloud interpolation, text detection, medical diagnosis.

computer vision research papers ieee

Alzheimer's Disease Detection

computer vision research papers ieee

Retinal OCT Disease Classification

Blood cell count, thoracic disease classification.

computer vision research papers ieee

Hand Pose Estimation

computer vision research papers ieee

Hand Segmentation

Gesture-to-gesture translation, visual grounding.

computer vision research papers ieee

Person-centric Visual Grounding

computer vision research papers ieee

Phrase Extraction and Grounding (PEG)

Visual odometry.

computer vision research papers ieee

Face Anti-Spoofing

Monocular visual odometry, rain removal.

computer vision research papers ieee

Single Image Deraining

Image clustering.

computer vision research papers ieee

Online Clustering

computer vision research papers ieee

Face Clustering

Multi-view subspace clustering, multi-modal subspace clustering, colorization.

computer vision research papers ieee

Line Art Colorization

computer vision research papers ieee

Point-interactive Image Colorization

computer vision research papers ieee

Color Mismatch Correction

Robot navigation.

computer vision research papers ieee

PointGoal Navigation

Social navigation.

computer vision research papers ieee

Sequential Place Learning

computer vision research papers ieee

Image Dehazing

computer vision research papers ieee

Single Image Dehazing

Video question answering.

computer vision research papers ieee

Zero-Shot Video Question Answer

Few-shot video question answering.

computer vision research papers ieee

Unsupervised Image-To-Image Translation

computer vision research papers ieee

Synthetic-to-Real Translation

computer vision research papers ieee

Multimodal Unsupervised Image-To-Image Translation

computer vision research papers ieee

Cross-View Image-to-Image Translation

computer vision research papers ieee

Fundus to Angiography Generation

Image manipulation, visual localization.

computer vision research papers ieee

Image Editing

Rolling shutter correction, shadow removal, joint deblur and frame interpolation, multimodal fashion image editing, multimodel-guided image editing, stereo matching, conformal prediction.

computer vision research papers ieee

Crowd Counting

computer vision research papers ieee

Visual Crowd Analysis

Group detection in crowds, human-object interaction detection.

computer vision research papers ieee

Affordance Recognition

Visual place recognition.

computer vision research papers ieee

Indoor Localization

3d place recognition, image matching.

computer vision research papers ieee

Semantic correspondence

Patch matching, set matching.

computer vision research papers ieee

Matching Disparate Images

Point cloud classification, jet tagging, few-shot point cloud classification, deepfake detection.

computer vision research papers ieee

Synthetic Speech Detection

Human detection of deepfakes, multimodal forgery detection, image deblurring, low-light image deblurring and enhancement, object reconstruction.

computer vision research papers ieee

3D Object Reconstruction

Document text classification, learning with noisy labels, multi-label classification of biomedical texts, political salient issue orientation detection, hyperspectral.

computer vision research papers ieee

Hyperspectral Image Classification

Hyperspectral unmixing, hyperspectral image segmentation, classification of hyperspectral images.

computer vision research papers ieee

Weakly Supervised Action Localization

Weakly-supervised temporal action localization.

computer vision research papers ieee

Temporal Action Proposal Generation

Activity recognition in videos, 2d human pose estimation, action anticipation.

computer vision research papers ieee

3D Face Animation

Semi-supervised human pose estimation, scene classification.

computer vision research papers ieee

Referring Expression

Point cloud generation, point cloud completion, compressive sensing, video quality assessment, video alignment, temporal sentence grounding, long-video activity recognition, keyword spotting.

computer vision research papers ieee

Small-Footprint Keyword Spotting

Visual keyword spotting, scene text detection.

computer vision research papers ieee

Curved Text Detection

Multi-oriented scene text detection, boundary detection.

computer vision research papers ieee

Junction Detection

Reconstruction, 3d human reconstruction.

computer vision research papers ieee

Single-View 3D Reconstruction

4d reconstruction, single-image-based hdr reconstruction, image matting.

computer vision research papers ieee

Semantic Image Matting

Camera calibration, superpixels, emotion classification.

computer vision research papers ieee

Video Retrieval

Video-text retrieval, video grounding, video-adverb retrieval, replay grounding, composed video retrieval (covr), point cloud segmentation, sensor fusion.

computer vision research papers ieee

Point cloud reconstruction

computer vision research papers ieee

3D Semantic Scene Completion

computer vision research papers ieee

3D Semantic Scene Completion from a single RGB image

Garment reconstruction.

computer vision research papers ieee

Few-Shot Transfer Learning for Saliency Prediction

computer vision research papers ieee

Aerial Video Saliency Prediction

Remote sensing.

computer vision research papers ieee

Remote Sensing Image Classification

Change detection for remote sensing images, building change detection for remote sensing images.

computer vision research papers ieee

Segmentation Of Remote Sensing Imagery

computer vision research papers ieee

The Semantic Segmentation Of Remote Sensing Imagery

Cross-modal retrieval, image-text matching, multilingual cross-modal retrieval.

computer vision research papers ieee

Zero-shot Composed Person Retrieval

Cross-modal retrieval on rsitmd, video summarization.

computer vision research papers ieee

Unsupervised Video Summarization

Supervised video summarization, document layout analysis.

computer vision research papers ieee

Document AI

Document understanding, human detection.

computer vision research papers ieee

Face Generation

computer vision research papers ieee

Talking Head Generation

Talking face generation.

computer vision research papers ieee

Face Age Editing

Facial expression generation, kinship face generation, video instance segmentation.

computer vision research papers ieee

Motion Synthesis

computer vision research papers ieee

Motion Style Transfer

Temporal human motion composition, privacy preserving deep learning, membership inference attack.

computer vision research papers ieee

Generalized Few-Shot Semantic Segmentation

Depth completion.

computer vision research papers ieee

Video Editing

Video temporal consistency, face reconstruction, motion forecasting.

computer vision research papers ieee

Multi-Person Pose forecasting

computer vision research papers ieee

Multiple Object Forecasting

Object discovery, virtual try-on, carla map leaderboard, dead-reckoning prediction, 3d anomaly detection, video anomaly detection, 3d classification, scene flow estimation.

computer vision research papers ieee

Self-supervised Scene Flow Estimation

computer vision research papers ieee

Generalized Referring Expression Segmentation

Gaze estimation.

computer vision research papers ieee

Texture Synthesis

Human parsing.

computer vision research papers ieee

Multi-Human Parsing

Weakly supervised segmentation.

computer vision research papers ieee

3D Multi-Person Pose Estimation (absolute)

computer vision research papers ieee

3D Multi-Person Pose Estimation (root-relative)

computer vision research papers ieee

3D Multi-Person Mesh Recovery

Facial landmark detection.

computer vision research papers ieee

Unsupervised Facial Landmark Detection

computer vision research papers ieee

3D Facial Landmark Localization

Pose tracking.

computer vision research papers ieee

3D Human Pose Tracking

Activity detection, inverse rendering, gait recognition.

computer vision research papers ieee

Multiview Gait Recognition

Gait recognition in the wild, interest point detection, homography estimation, multi-view learning, incomplete multi-view clustering, scene segmentation.

computer vision research papers ieee

Thermal Image Segmentation

Sign language recognition.

computer vision research papers ieee

3D Character Animation From A Single Photo

Interactive segmentation, disease prediction, disease trajectory forecasting.

computer vision research papers ieee

Dichotomous Image Segmentation

Temporal localization.

computer vision research papers ieee

Language-Based Temporal Localization

Temporal defect localization, scene generation, template matching, event-based vision.

computer vision research papers ieee

Event-based Optical Flow

computer vision research papers ieee

Event-Based Video Reconstruction

Event-based motion estimation, multi-label image classification.

computer vision research papers ieee

Multi-label Image Recognition with Partial Labels

computer vision research papers ieee

3D Hand Pose Estimation

Object counting, intelligent surveillance.

computer vision research papers ieee

Vehicle Re-Identification

Relation network, visual dialog.

computer vision research papers ieee

Image Recognition

Fine-grained image recognition, license plate recognition, motion segmentation, camera localization.

computer vision research papers ieee

Camera Relocalization

Disparity estimation.

computer vision research papers ieee

3D Object Tracking

computer vision research papers ieee

3D Single Object Tracking

Lidar semantic segmentation, text to video retrieval, partially relevant video retrieval, text spotting.

computer vision research papers ieee

Person Search

Decision making under uncertainty.

computer vision research papers ieee

Uncertainty Visualization

Knowledge distillation.

computer vision research papers ieee

Data-free Knowledge Distillation

Self-knowledge distillation, mixed reality, few-shot class-incremental learning, class-incremental semantic segmentation, non-exemplar-based class incremental learning, text-to-video generation, text-to-video editing, subject-driven video generation, shadow detection.

computer vision research papers ieee

Shadow Detection And Removal

computer vision research papers ieee

Unconstrained Lip-synchronization

computer vision research papers ieee

Cross-corpus

Micro-expression recognition, micro-expression spotting.

computer vision research papers ieee

3D Facial Expression Recognition

computer vision research papers ieee

Smile Recognition

Moment retrieval.

computer vision research papers ieee

Video Inpainting

computer vision research papers ieee

Future prediction

Overlapped 10-1, overlapped 15-1, overlapped 15-5, disjoint 10-1, disjoint 15-1, image categorization, fine-grained visual categorization, deep attention, video enhancement.

computer vision research papers ieee

Face Image Quality Assessment

Lightweight face recognition.

computer vision research papers ieee

Age-Invariant Face Recognition

Synthetic face recognition, face quality assessement.

computer vision research papers ieee

Stereo Image Super-Resolution

Burst image super-resolution, satellite image super-resolution, multispectral image super-resolution, physics-informed machine learning, soil moisture estimation, line detection, color constancy.

computer vision research papers ieee

Few-Shot Camera-Adaptive Color Constancy

Image cropping, stereo matching hand.

computer vision research papers ieee

Visual Recognition

computer vision research papers ieee

Fine-Grained Visual Recognition

Human mesh recovery, zero-shot action recognition.

computer vision research papers ieee

3D Multi-Object Tracking

Real-time multi-object tracking, multi-animal tracking with identification, grounded multiple object tracking, sign language translation.

computer vision research papers ieee

Tone Mapping

Video reconstruction.

computer vision research papers ieee

Zero Shot Segmentation

Surface normals estimation.

computer vision research papers ieee

Natural Language Transduction

Transparent object detection, transparent objects, video restoration.

computer vision research papers ieee

Analog Video Restoration

3d absolute human pose estimation.

computer vision research papers ieee

Text-to-Face Generation

Image forensics, novel class discovery.

computer vision research papers ieee

HDR Reconstruction

Multi-exposure image fusion, abnormal event detection in video.

computer vision research papers ieee

Semi-supervised Anomaly Detection

Cross-domain few-shot learning, probabilistic deep learning, unsupervised few-shot image classification, generalized few-shot classification, breast cancer histology image classification.

computer vision research papers ieee

Breast Cancer Detection

Breast cancer histology image classification (20% labels), infrared and visible image fusion.

computer vision research papers ieee

Steganalysis

Texture classification, vision-language navigation.

computer vision research papers ieee

Spoof Detection

Face presentation attack detection, detecting image manipulation, cross-domain iris presentation attack detection, finger dorsal image spoof detection, image animation.

computer vision research papers ieee

Iris Recognition

Pupil dilation, pedestrian attribute recognition.

computer vision research papers ieee

Reflection Removal

computer vision research papers ieee

One-shot visual object segmentation

computer vision research papers ieee

Sketch Recognition

computer vision research papers ieee

Face Sketch Synthesis

Drawing pictures.

computer vision research papers ieee

Photo-To-Caricature Translation

computer vision research papers ieee

Unbiased Scene Graph Generation

computer vision research papers ieee

Panoptic Scene Graph Generation

Action understanding, automatic post-editing.

computer vision research papers ieee

Document Image Classification

computer vision research papers ieee

Geometric Matching

Highlight detection, multi-view 3d reconstruction, object categorization, severity prediction, intubation support prediction, meme classification, hateful meme classification, blind face restoration.

computer vision research papers ieee

Cloud Detection

computer vision research papers ieee

Dense Captioning

Face reenactment.

computer vision research papers ieee

Human action generation

computer vision research papers ieee

Action Generation

Image outpainting.

computer vision research papers ieee

Person Retrieval

Surgical phase recognition, online surgical phase recognition, offline surgical phase recognition, human dynamics.

computer vision research papers ieee

3D Human Dynamics

computer vision research papers ieee

Semantic SLAM

computer vision research papers ieee

Object SLAM

Action quality assessment, image stitching.

computer vision research papers ieee

Text based Person Retrieval

Text-to-image, story visualization, complex scene breaking and synthesis, object segmentation.

computer vision research papers ieee

Camouflaged Object Segmentation

Landslide segmentation, text-line extraction, situation recognition, grounded situation recognition, image deconvolution.

computer vision research papers ieee

Intrinsic Image Decomposition

Line segment detection, multi-target domain adaptation, image fusion, pansharpening, image to video generation.

computer vision research papers ieee

Unconditional Video Generation

Table recognition, weakly-supervised instance segmentation, image smoothing.

computer vision research papers ieee

Camouflaged Object Segmentation with a Single Task-generic Prompt

Image morphing, image steganography, point clouds, rotated mnist, diffusion personalization.

computer vision research papers ieee

Diffusion Personalization Tuning Free

Efficient diffusion personalization, image shadow removal, layout design, motion detection, sports analytics, viewpoint estimation.

computer vision research papers ieee

Fake Image Detection

computer vision research papers ieee

GAN image forensics

computer vision research papers ieee

Fake Image Attribution

Drone navigation, drone-view target localization, lane detection.

computer vision research papers ieee

3D Lane Detection

License plate detection.

computer vision research papers ieee

Multi-Object Tracking and Segmentation

computer vision research papers ieee

Occlusion Handling

computer vision research papers ieee

Video Panoptic Segmentation

Person identification, zero-shot transfer image classification.

computer vision research papers ieee

Value prediction

Body mass index (bmi) prediction, contour detection.

computer vision research papers ieee

Face Image Quality

Photo retouching.

computer vision research papers ieee

Grasp Generation

computer vision research papers ieee

3D Canonical Hand Pose Estimation

Shape representation of 3d point clouds, 3d point cloud reconstruction, dense pixel correspondence estimation, human part segmentation.

computer vision research papers ieee

Image to 3D

Symmetry detection, video style transfer, motion retargeting, referring image matting.

computer vision research papers ieee

Referring Image Matting (Expression-based)

computer vision research papers ieee

Referring Image Matting (Keyword-based)

computer vision research papers ieee

Referring Image Matting (RefMatte-RW100)

Referring image matting (prompt-based).

computer vision research papers ieee

hand-object pose

Robot pose estimation, 3d point cloud linear classification, crop yield prediction, image quality estimation.

computer vision research papers ieee

Material Recognition

Road damage detection.

computer vision research papers ieee

Document Shadow Removal

Space-time video super-resolution, traffic sign detection, video matting.

computer vision research papers ieee

Human Interaction Recognition

One-shot 3d action recognition, mutual gaze, affordance detection.

computer vision research papers ieee

Hand Detection

Image similarity search.

computer vision research papers ieee

Multiview Learning

Person recognition.

computer vision research papers ieee

Precipitation Forecasting

Inverse tone mapping, image/document clustering, self-organized clustering, 3d shape modeling.

computer vision research papers ieee

Action Analysis

Facial editing.

computer vision research papers ieee

Holdout Set

Image forgery detection, image instance retrieval, amodal instance segmentation, material classification.

computer vision research papers ieee

Open Vocabulary Attribute Detection

Referring expression generation, instance search.

computer vision research papers ieee

Audio Fingerprint

computer vision research papers ieee

Open-World Semi-Supervised Learning

Semi-supervised image classification (cold start).

computer vision research papers ieee

3D Object Reconstruction From A Single Image

Art analysis, event segmentation, generic event boundary detection, food recognition.

computer vision research papers ieee

Gaze Prediction

Image-variation, point cloud super resolution, semi-supervised instance segmentation, skills assessment.

computer vision research papers ieee

Sensor Modeling

Video segmentation, camera shot boundary detection, open-vocabulary video segmentation, open-world video segmentation, lung nodule classification, lung nodule 3d classification, lung nodule detection, lung nodule 3d detection, video prediction, earth surface forecasting, predict future video frames, 3d scene reconstruction, handwriting generation, image retouching, motion magnification, scene change detection.

computer vision research papers ieee

Sketch-to-Image Translation

Skills evaluation, highlight removal, 3d shape reconstruction from a single 2d image.

computer vision research papers ieee

Shape from Texture

Handwriting verification, bangla spelling error correction, birds eye view object detection.

computer vision research papers ieee

Zero-Shot Composed Image Retrieval (ZS-CIR)

computer vision research papers ieee

JPEG Artifact Removal

Multispectral object detection, pose retrieval, rgb-d reconstruction, scanpath prediction, seeing beyond the visible, deception detection, deception detection in videos, constrained lip-synchronization, face dubbing.

computer vision research papers ieee

Video Visual Relation Detection

Human-object relationship detection, 3d shape reconstruction, 3d shape representation.

computer vision research papers ieee

3D Dense Shape Correspondence

Audio-visual synchronization, image manipulation localization, kinship verification, medical image enhancement, multiple people tracking.

computer vision research papers ieee

Network Interpretation

Semi-supervised domain generalization, single-object discovery, training-free 3d point cloud classification, unsupervised semantic segmentation.

computer vision research papers ieee

Unsupervised Semantic Segmentation with Language-image Pre-training

Binary classification, cancer-no cancer per image classification, llm-generated text detection, cancer-no cancer per breast classification, suspicous (birads 4,5)-no suspicous (birads 1,2,3) per image classification, cancer-no cancer per view classification.

computer vision research papers ieee

Sequential Place Recognition

Autonomous flight (dense forest), multimodal machine translation.

computer vision research papers ieee

Face to Face Translation

Multimodal lexical translation, multiple object tracking with transformer.

computer vision research papers ieee

Multiple Object Track and Segmentation

10-shot image generation, bokeh effect rendering, drivable area detection, face anonymization, font recognition, horizon line estimation, image imputation.

computer vision research papers ieee

Instance Shadow Detection

Long video retrieval (background removed), medical image denoising.

computer vision research papers ieee

Occlusion Estimation

Open vocabulary panoptic segmentation, physiological computing.

computer vision research papers ieee

Lake Ice Monitoring

Short-term object interaction anticipation, spatio-temporal video grounding, unsupervised 3d point cloud linear evaluation, video forensics, wireframe parsing, single-image-generation, unsupervised anomaly detection with specified settings -- 30% anomaly, root cause ranking, anomaly detection at 30% anomaly, anomaly detection at various anomaly percentages.

computer vision research papers ieee

Unsupervised Contextual Anomaly Detection

Facial expression recognition, cross-domain facial expression recognition, zero-shot facial expression recognition, landmark tracking, muscle tendon junction identification, 2d semantic segmentation task 3 (25 classes), document enhancement, 3d scene editing, action assessment, ad-hoc video search, defocus blur detection, event data classification, generalized referring expression comprehension, image deblocking, motion disentanglement, personality trait recognition, synthetic image detection, traffic accident detection, accident anticipation, unsupervised landmark detection, visual speech recognition, lip to speech synthesis, gaze redirection, 2d pose estimation, category-agnostic pose estimation, overlapping pose estimation, weakly supervised action segmentation (transcript), weakly supervised action segmentation (action set)), calving front delineation in synthetic aperture radar imagery, calving front delineation in synthetic aperture radar imagery with fixed training amount.

computer vision research papers ieee

Handwritten Line Segmentation

Handwritten word segmentation, handwritten text recognition, handwritten document recognition, unsupervised text recognition.

computer vision research papers ieee

General Action Video Anomaly Detection

Physical video anomaly detection, monocular cross-view road scene parsing(road), monocular cross-view road scene parsing(vehicle).

computer vision research papers ieee

Transparent Object Depth Estimation

3d open-vocabulary instance segmentation.

computer vision research papers ieee

4D Panoptic Segmentation

Animated gif generation, historical color image dating, stochastic human motion prediction, image retargeting, image and video forgery detection, infrared image super-resolution, motion captioning, personalized segmentation, persuasion strategies, scene-aware dialogue, spatial relation recognition, spatial token mixer, steganographics, story continuation.

computer vision research papers ieee

Unsupervised Anomaly Detection with Specified Settings -- 0.1% anomaly

Unsupervised anomaly detection with specified settings -- 1% anomaly, unsupervised anomaly detection with specified settings -- 10% anomaly, unsupervised anomaly detection with specified settings -- 20% anomaly, vehicle speed estimation, visual social relationship recognition, zero-shot text-to-video generation, continual anomaly detection, continual semantic segmentation, overlapped 5-3, overlapped 25-25, evolving domain generalization, source-free domain generalization, micro-expression generation, micro-expression generation (megc2021), unsupervised panoptic segmentation, unsupervised zero-shot panoptic segmentation, 3d rotation estimation, 3d semantic occupancy prediction, camera auto-calibration, data ablation, defocus estimation, derendering.

computer vision research papers ieee

Occluded Face Detection

Fingertip detection, gait identification, human-object interaction concept discovery, image comprehension, speaker-specific lip to speech synthesis, multi-person pose estimation, neural stylization.

computer vision research papers ieee

Part-aware Panoptic Segmentation

computer vision research papers ieee

Population Mapping

Pornography detection, raw reconstruction, semi-supervised video classification, spectrum cartography, synthetic image attribution, training-free 3d part segmentation, unsupervised image decomposition, video propagation, visual analogies, explanatory visual question answering, weakly supervised 3d point cloud segmentation, weakly-supervised panoptic segmentation, drone-based object tracking, text-guided-generation, brain visual reconstruction, brain visual reconstruction from fmri, fashion understanding, semi-supervised fashion compatibility.

computer vision research papers ieee

intensity image denoising

Lifetime image denoising, observation completion, active observation completion, boundary grounding.

computer vision research papers ieee

Video Narrative Grounding

3d inpainting, 4d spatio temporal semantic segmentation.

computer vision research papers ieee

Age Estimation

computer vision research papers ieee

Few-shot Age Estimation

Age and gender estimation, brdf estimation, camouflage segmentation, clothing attribute recognition, depth image estimation, detecting shadows, dynamic texture recognition.

computer vision research papers ieee

Disguised Face Verification

Few shot open set object detection, generalized zero-shot learning - unseen, hd semantic map learning, human-object interaction anticipation, image deep networks, keypoint detection and image matching, manufacturing quality control, materials imaging, multi-person pose estimation and tracking.

computer vision research papers ieee

Multi-modal image segmentation

Multi-object discovery, neural radiance caching.

computer vision research papers ieee

Parking Space Occupancy

computer vision research papers ieee

Partial Video Copy Detection

computer vision research papers ieee

Multimodal Patch Matching

Perpetual view generation, prediction of occupancy grid maps, procedure learning, prompt-driven zero-shot domain adaptation, repetitive action counting, svbrdf estimation, single-shot hdr reconstruction, on-the-fly sketch based image retrieval, thermal image denoising, trademark retrieval, unsupervised instance segmentation, unsupervised zero-shot instance segmentation, vehicle key-point and orientation estimation.

computer vision research papers ieee

Video-Adverb Retrieval (Unseen Compositions)

Video-to-image affordance grounding.

computer vision research papers ieee

Visual Sentiment Prediction

Human-scene contact detection, localization in video forgery, 3d canonicalization.

computer vision research papers ieee

Cube Engraving Classification

3d scene graph alignment, 3d surface generation.

computer vision research papers ieee

Visibility Estimation from Point Cloud

Amodal layout estimation, blink estimation, camera absolute pose regression, change data generation.

computer vision research papers ieee

Image-Guided Composition

Constrained diffeomorphic image registration, continuous affect estimation, deep feature inversion, document image skew estimation, earthquake prediction, fashion compatibility learning.

computer vision research papers ieee

Displaced People Recognition

Finger vein recognition, flooded building segmentation.

computer vision research papers ieee

Future Hand Prediction

Gaze target estimation, house generation, human fmri response prediction, hurricane forecasting, ifc entity classification, image declipping, image similarity detection.

computer vision research papers ieee

One-Shot Face Stylization

Image text removal, image-to-gps verification.

computer vision research papers ieee

Image-based Automatic Meter Reading

Dial meter reading, indoor scene reconstruction, jpeg decompression.

computer vision research papers ieee

Kiss Detection

Laminar-turbulent flow localisation.

computer vision research papers ieee

Landmark Recognition

Brain landmark detection, corpus video moment retrieval, mllm evaluation: aesthetics, medical image deblurring, mental workload estimation, meter reading, micro-gesture recognition, mistake detection, motion expressions guided video segmentation, natural image orientation angle detection, multi-object colocalization, multilingual text-to-image generation, video emotion detection, nwp post-processing, occluded 3d object symmetry detection, open set video captioning, open vocabulary semantic segmentation, zero-guidance segmentation, pso-convnets dynamics 1, pso-convnets dynamics 2, partial point cloud matching.

computer vision research papers ieee

Partially View-aligned Multi-view Learning

computer vision research papers ieee

Pedestrian Detection

computer vision research papers ieee

Thermal Infrared Pedestrian Detection

Personality trait recognition by face, physical attribute prediction, point cloud semantic completion, point cloud classification dataset, point- of-no-return (pnr) temporal localization, pose contrastive learning, potrait generation, prostate zones segmentation, pulmorary vessel segmentation, pulmonary artery–vein classification, reference expression generation, safety perception recognition, interspecies facial keypoint transfer, specular reflection mitigation, specular segmentation, state change object detection, surface normals estimation from point clouds, transform a video into a comics, transparency separation, typeface completion.

computer vision research papers ieee

Unbalanced Segmentation

computer vision research papers ieee

Unsupervised Long Term Person Re-Identification

Video correspondence flow, video frame interpolation.

computer vision research papers ieee

eXtreme-Video-Frame-Interpolation

Video individual counting.

computer vision research papers ieee

Key-Frame-based Video Super-Resolution (K = 15)

Yield mapping in apple orchards, lidar absolute pose regression, opd: single-view 3d openable part detection, self-supervised scene text recognition, video narration captioning, spectral estimation, spectral estimation from a single rgb image, 3d prostate segmentation, aggregate xview3 metric, atomic action recognition, composite action recognition, calving front delineation from synthetic aperture radar imagery, computer vision transduction, crosslingual text-to-image generation, damaged building detection, zero-shot dense video captioning, document to image conversion, frame duplication detection, geometrical view, hyperview challenge.

computer vision research papers ieee

Image Operation Chain Detection

Kinematic based workflow recognition, logo recognition.

computer vision research papers ieee

MLLM Aesthetic Evaluation

Motion detection in non-stationary scenes, open-set video tagging, segmentation based workflow recognition, small object detection.

computer vision research papers ieee

Rice Grain Disease Detection

Sperm morphology classification, video & kinematic base workflow recognition, video based workflow recognition, video, kinematic & segmentation base workflow recognition, animal pose estimation.

computer vision research papers ieee

International Journal of Computer Vision

International Journal of Computer Vision (IJCV) details the science and engineering of this rapidly growing field. Regular articles present major technical advances of broad general interest. Survey articles offer critical reviews of the state of the art and/or tutorial presentations of pertinent topics.

Coverage includes:

- Mathematical, physical and computational aspects of computer vision: image formation, processing, analysis, and interpretation; machine learning techniques; statistical approaches; sensors.

- Applications: image-based rendering, computer graphics, robotics, photo interpretation, image retrieval, video analysis and annotation, multi-media, and more.

- Connections with human perception: computational and architectural aspects of human vision.

The journal also features book reviews, position papers, editorials by leading scientific figures, as well as additional on-line material, such as still images, video sequences, data sets, and software. Please note: the median time indicated below is computed over all the submitted manuscripts including the ones that are not put into the review pipeline at the onset of the review process. The typical time to first decision for manuscripts is approximately 96 days.  

  • Yasuyuki Matsushita,
  • Jiri Matas,
  • Svetlana Lazebnik

computer vision research papers ieee

Latest issue

Volume 132, Issue 3

Latest articles

Robust heterogeneous model fitting for multi-source image correspondences.

  • Shuyuan Lin
  • Feiran Huang

computer vision research papers ieee

FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild

  • Zhi-Song Liu
  • Robin Courant
  • Vicky Kalogeiton

computer vision research papers ieee

Learning to Generalize over Subpartitions for Heterogeneity-Aware Domain Adaptive Nuclei Segmentation

  • Dongnan Liu
  • Weidong Cai

computer vision research papers ieee

UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-modal Learning

  • Xue-Feng Zhu
  • Tianyang Xu
  • Josef Kittler

computer vision research papers ieee

Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion

  • Gongjie Zhang
  • Zhipeng Luo
  • Eric P. Xing

computer vision research papers ieee

Journal updates

Special issue guidelines.

Guidelines for IJCV special issue papers and proposals

Call for Papers: Special Issue on Biometrics Security and Privacy

Guest editors:  Jun Wan, Sergio Escalera, Arun Ross, Philip Torr Submission deadline: extended to 15 September 2023

Call for Papers: Special Issue on Open-World Visual Recognition

Guest editors:  Zhun Zhong, Hong Liu, Yin Cui, Shin'ichi Satoh, Nicu Sebe, Ming-Hsuan Yang Submission deadline:  extended to 15 December 2023

Call for Papers: Special Issue on Computer Vision Approaches for Animal Tracking and Modeling 2023

Guest editors:  Anna Zamansky, Helge Rhodin, Silvia Zuffi, Hyun Soo Park, Sara Beery, Angjoo Kanazawa, Shohei Nobuhara Submission deadline:  31 August 2023

Journal information

  • ACM Digital Library
  • Current Contents/Engineering, Computing and Technology
  • EI Compendex
  • Google Scholar
  • Japanese Science and Technology Agency (JST)
  • Norwegian Register for Scientific Journals and Series
  • OCLC WorldCat Discovery Service
  • Science Citation Index Expanded (SCIE)
  • TD Net Discovery Service
  • UGC-CARE List (India)

Rights and permissions

Springer policies

© Springer Science+Business Media, LLC, part of Springer Nature

  • Find a journal
  • Publish with us
  • Track your research

Survey of Internet of Things Applications using Raspberry Pi and Computer Vision

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

IEEE Transactions on Games

Hero Image for  Special Issue on Computer Vision and Games

Special Issue on Computer Vision and Games

Video Games and Computer Vision research have long held a symbiotic relationship. On the one hand, virtual worlds in games are often used for collecting training data or as testbeds for computer vision models since they provide a greater deal of flexibility, control and scalability in the data collection process compared to the real world. On the other hand, computer vision advancements have enabled us to push the frontiers of what is possible within these artificial game worlds and have transformed the processes with which these worlds are created. However, significant research questions still remain unaddressed both in the field (Computer Vision) and the domain (Games), which include technical and engineering challenges.

This special issue invites research papers aiming to bridge the existing gaps between computer vision research and games engineering, with the motive of bringing together the games research community and the computer vision community that have largely operated independently until now. We are inviting papers for two main tracks. The first track focuses on introducing novel techniques within computer vision research that can advance the field of digital games. The second track, instead, focuses on leveraging game technologies to advance state-of-the-art techniques in computer vision. The list of topics below is not inclusive of all research directions that will be represented.

  • 1) Computer Vision for Games

computer_vision_for_games_b

  • CV for game-playing, game testing and player modelling.
  • Data-driven CV to improve game graphics, animations, level-design, etc. as well as procedural content generation.
  • HCI through visual interfaces (gestures, posture, gaze, etc.).
  • Extended reality games.
  • Synthetic data and media generation based on users' emotions, behaviour, etc.
  • Improving real-time applicability of vision models integrated within games and game engines.
  • 2) Games for Computer Vision

games_for_computer_vision_b

  • Game worlds that aid data augmentation techniques.
  • Rich game-based labelled datasets for tasks such as object detection, segmentation, or depth and flow estimation.
  • Ethics of game-based data collection and inference.
  • Forward modelling in and for games.
  • Generalisation and robustness in vision models leveraging a plethora of existing commercial games.
  • Unsupervised pre-training of image/video representations and world transition models from gameplay data.

We invite the submission of high quality papers on the topics above in the full paper format. Authors should follow normal IEEE Transactions on Games guidelines for their submissions, but clearly identify their papers for this special issue during the submission process. Extended versions of previously published conference or workshop papers are welcome, provided that the journal paper is a significant extension, and is accompanied by a cover letter explaining the additional contribution. You may visit the submission guidelines for author information guidelines and page length limits.

  • Important Dates:
  • Paper submission: January 31, 2024
  • First decisions: May 31st, 2024
  • Early access SI publication (online): August 2024
  • Publication in print: End 2024
  • Guest Editors:
  • Chintan Trivedi (University of Malta)
  • Matthew Guzdial (University of Alberta)
  • Konstantinos Makantasis (University of Malta)
  • Julian Togelius (New York University)
  • Nicu Sebe (University of Trento)

computer vision Recently Published Documents

Total documents.

  • Latest Documents
  • Most Cited Documents
  • Contributed Authors
  • Related Sources
  • Related Keywords

2D Computer Vision

A survey on generative adversarial networks: variants, applications, and training.

The Generative Models have gained considerable attention in unsupervised learning via a new and practical framework called Generative Adversarial Networks (GAN) due to their outstanding data generation capability. Many GAN models have been proposed, and several practical applications have emerged in various domains of computer vision and machine learning. Despite GANs excellent success, there are still obstacles to stable training. The problems are Nash equilibrium, internal covariate shift, mode collapse, vanishing gradient, and lack of proper evaluation metrics. Therefore, stable training is a crucial issue in different applications for the success of GANs. Herein, we survey several training solutions proposed by different researchers to stabilize GAN training. We discuss (I) the original GAN model and its modified versions, (II) a detailed analysis of various GAN applications in different domains, and (III) a detailed study about the various GAN training obstacles as well as training solutions. Finally, we reveal several issues as well as research outlines to the topic.

Efficient Channel Attention Based Encoder–Decoder Approach for Image Captioning in Hindi

Image captioning refers to the process of generating a textual description that describes objects and activities present in a given image. It connects two fields of artificial intelligence, computer vision, and natural language processing. Computer vision and natural language processing deal with image understanding and language modeling, respectively. In the existing literature, most of the works have been carried out for image captioning in the English language. This article presents a novel method for image captioning in the Hindi language using encoder–decoder based deep learning architecture with efficient channel attention. The key contribution of this work is the deployment of an efficient channel attention mechanism with bahdanau attention and a gated recurrent unit for developing an image captioning model in the Hindi language. Color images usually consist of three channels, namely red, green, and blue. The channel attention mechanism focuses on an image’s important channel while performing the convolution, which is basically to assign higher importance to specific channels over others. The channel attention mechanism has been shown to have great potential for improving the efficiency of deep convolution neural networks (CNNs). The proposed encoder–decoder architecture utilizes the recently introduced ECA-NET CNN to integrate the channel attention mechanism. Hindi is the fourth most spoken language globally, widely spoken in India and South Asia; it is India’s official language. By translating the well-known MSCOCO dataset from English to Hindi, a dataset for image captioning in Hindi is manually created. The efficiency of the proposed method is compared with other baselines in terms of Bilingual Evaluation Understudy (BLEU) scores, and the results obtained illustrate that the method proposed outperforms other baselines. The proposed method has attained improvements of 0.59%, 2.51%, 4.38%, and 3.30% in terms of BLEU-1, BLEU-2, BLEU-3, and BLEU-4 scores, respectively, with respect to the state-of-the-art. Qualities of the generated captions are further assessed manually in terms of adequacy and fluency to illustrate the proposed method’s efficacy.

Feature Matching-based Approaches to Improve the Robustness of Android Visual GUI Testing

In automated Visual GUI Testing (VGT) for Android devices, the available tools often suffer from low robustness to mobile fragmentation, leading to incorrect results when running the same tests on different devices. To soften these issues, we evaluate two feature matching-based approaches for widget detection in VGT scripts, which use, respectively, the complete full-screen snapshot of the application ( Fullscreen ) and the cropped images of its widgets ( Cropped ) as visual locators to match on emulated devices. Our analysis includes validating the portability of different feature-based visual locators over various apps and devices and evaluating their robustness in terms of cross-device portability and correctly executed interactions. We assessed our results through a comparison with two state-of-the-art tools, EyeAutomate and Sikuli. Despite a limited increase in the computational burden, our Fullscreen approach outperformed state-of-the-art tools in terms of correctly identified locators across a wide range of devices and led to a 30% increase in passing tests. Our work shows that VGT tools’ dependability can be improved by bridging the testing and computer vision communities. This connection enables the design of algorithms targeted to domain-specific needs and thus inherently more usable and robust.

Computer vision to recognize construction waste compositions: A novel boundary-aware transformer (BAT) model

Computer vision for autonomous uav flight safety: an overview and a vision-based safe landing pipeline example.

Recent years have seen an unprecedented spread of Unmanned Aerial Vehicles (UAVs, or “drones”), which are highly useful for both civilian and military applications. Flight safety is a crucial issue in UAV navigation, having to ensure accurate compliance with recently legislated rules and regulations. The emerging use of autonomous drones and UAV swarms raises additional issues, making it necessary to transfuse safety- and regulations-awareness to relevant algorithms and architectures. Computer vision plays a pivotal role in such autonomous functionalities. Although the main aspects of autonomous UAV technologies (e.g., path planning, navigation control, landing control, mapping and localization, target detection/tracking) are already mature and well-covered, ensuring safe flying in the vicinity of crowds, avoidance of passing over persons, or guaranteed emergency landing capabilities in case of malfunctions, are generally treated as an afterthought when designing autonomous UAV platforms for unstructured environments. This fact is reflected in the fragmentary coverage of the above issues in current literature. This overview attempts to remedy this situation, from the point of view of computer vision. It examines the field from multiple aspects, including regulations across the world and relevant current technologies. Finally, since very few attempts have been made so far towards a complete UAV safety flight and landing pipeline, an example computer vision-based UAV flight safety pipeline is introduced, taking into account all issues present in current autonomous drones. The content is relevant to any kind of autonomous drone flight (e.g., for movie/TV production, news-gathering, search and rescue, surveillance, inspection, mapping, wildlife monitoring, crowd monitoring/management), making this a topic of broad interest.

Automatic recognition and classification of microseismic waveforms based on computer vision

Promises and pitfalls of using computer vision to make inferences about landscape preferences: evidence from an urban-proximate park system, weight-sharing neural architecture search: a battle to shrink the optimization gap.

Neural architecture search (NAS) has attracted increasing attention. In recent years, individual search methods have been replaced by weight-sharing search methods for higher search efficiency, but the latter methods often suffer lower instability. This article provides a literature review on these methods and owes this issue to the optimization gap . From this perspective, we summarize existing approaches into several categories according to their efforts in bridging the gap, and we analyze both advantages and disadvantages of these methodologies. Finally, we share our opinions on the future directions of NAS and AutoML. Due to the expertise of the authors, this article mainly focuses on the application of NAS to computer vision problems.

Assessing surface drainage conditions at the street and neighborhood scale: A computer vision and flow direction method applied to lidar data

Export citation format, share document.

IMAGES

  1. IEEE Paper for Image Processing

    computer vision research papers ieee

  2. (PDF) Attention is All They Need: Exploring the Media Archaeology of

    computer vision research papers ieee

  3. Advanced submicron research and technology development at the national

    computer vision research papers ieee

  4. Ieee Paper Review Format / Paper ieee / The institute of electrical and

    computer vision research papers ieee

  5. Empirical Evaluation of Computer Vision Algorithms

    computer vision research papers ieee

  6. IEEE format sample

    computer vision research papers ieee

VIDEO

  1. IEEE video

  2. EPSILON 2023: DAY 1

COMMENTS

  1. The application of deep learning in computer vision

    This paper first reviews the main ideas of deep learning, and displays several related frequently-used algorithms for computer vision. Afterwards, the current research status of computer vision field is demonstrated in this paper, particularly the main applications of deep learning in the research field.

  2. Computer Vision Technology Based on Deep Learning

    Based on the current commonly used method of computer vision technology-deep learning, this paper outlines the development of deep learning models, and determines the inflection point of the development of the introduction of convolutional neural networks.

  3. Deep learning in computer vision: A critical review of emerging

    Computer vision Literature review Code metadata Permanent link to reproducible Capsule: https://doi.org/10.24433/CO.0411648.v1. 1. Introduction Deep learning (DL), a prevailing branch of artificial intelligence (AI), has been extended with diversified network structures.

  4. Computer Vision Based Object Detection and Recognition ...

    Computer Vision Based Object Detection and Recognition System for Image Searching Abstract: Computer Vision is a concept which works with the methods for automatic extraction, analysis and understanding of useful information from a single image or a sequence of images.

  5. Computer vision: basic principles

    Abstract: The author provides a general introduction to computer vision. He discusses basic techniques and computer implementations, and also indicates areas in which further research is needed. He focuses on two-dimensional object recognition, i.e. recognition of an object whose spatial orientation, relative to the viewing direction is known.< >

  6. Analysis of Computer Vision for Graphics and Animation

    Abstract: The field of computer vision is rapidly evolving. Pictures and videos can be obtained and processed to model, duplicate, and occasionally introduce additional visuals to complete valuable tasks. This Paper outlines a method for gathering, refining, and comprehending video and images.

  7. CVPR 2021 Open Access Repository

    These CVPR 2021 papers are the Open Access versions, provided by the Computer Vision Foundation. Except for the watermark, they are identical to the accepted versions; the final published version of the proceedings is available on IEEE Xplore. This material is presented to ensure timely dissemination of scholarly and technical work.

  8. Computer Vision Imaging Based on Artificial Intelligence

    Abstract: In order to study artificial intelligence-based computer vision imaging, computer technology was used to efficiently and accurately obtain relevant information from environmental images or videos. Things and phenomena in the objective world were analyzed, judged, and decided.

  9. Call for Papers: IEEE/CVF CVPR

    Reviews Released: 24 January 2023 Rebuttal Period: 24-31 January 2023 Final Decisions: 27 February 2023 Papers in the main technical program must describe high-quality, original research. Topics of interest include all aspects of computer vision and pattern recognition including, but not limited to: 3D from multi-view and sensors

  10. PDF IEEE Transactions on Evolutionary Computation Special Issue on

    Evolutionary computer vision (ECV) is at the intersection of two major research fields of artificial intelligence: computer vision and evolutionary computation. This special issue aims to provide an overview of state-of-the-art contributions to the latest research and development in the discipline.

  11. Resources for Computer Vision

    First coming into discovery with Seymour Papert's Summer Vision Project of 1966, computer vision has been in development for decades, improving all along the way and creating new possibilities for everyone. Though complex, the process of these systems can be broken down into four fundamental steps:

  12. Research on Image Processing Technology of Computer Vision Algorithm

    Starting from computer vision algorithms and image processing technologies, the computer vision display system is designed, and image distortion correction algorithms are explored for reference. Published in: 2020 International Conference on Computer Vision, Image and Deep Learning (CVIDL) Article #: Date of Conference: 10-12 July 2020

  13. WACV 2023 Open Access Repository

    These WACV 2023 papers are the Open Access versions, provided by the Computer Vision Foundation. ... {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2023}, pages = {4799-4808} } Self Supervised Low Dose Computed Tomography Image Denoising Using Invertible Network Exploiting ...

  14. Computer Vision and Pattern Recognition

    José-M. Acosta-Triana, David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos. Comments: Accepted at the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING) Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)

  15. Rethinking the Inception Architecture for Computer Vision

    Convolutional networks are at the core of most state-of-the-art computer vision solutions for a wide variety of tasks. Since 2014 very deep convolutional networks started to become mainstream, yielding substantial gains in various benchmarks. Although increased model size and computational cost tend to translate to immediate quality gains for most tasks (as long as enough labeled data is ...

  16. CVF Open Access

    These research papers are the Open Access versions, provided by the Computer Vision Foundation. Except for the watermark, they are identical to the accepted versions; the final published version of the proceedings is available on IEEE Xplore.

  17. Computer Vision

    Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets. ... Or, discuss a change on Slack. Browse SoTA > Computer Vision Computer Vision. 4450 benchmarks • 1365 tasks • 2844 datasets • 42384 papers with code Semantic Segmentation Semantic Segmentation. 287 benchmarks ...

  18. IET Computer Vision

    IET Computer Vision. IET Computer Vision is a fully open access journal that introduces new horizons and sets the agenda for future avenues of research in a wide range of areas of computer vision. We are a fully open access journal that welcomes research articles reporting novel methodologies and significant results of interest.

  19. Home

    International Journal of Computer Vision (IJCV) details the science and engineering of this rapidly growing field. Regular articles present major technical advances of broad general interest. Survey articles offer critical reviews of the state of the art and/or tutorial presentations of pertinent topics. Coverage includes:

  20. Machine Learning in Computer Vision

    The machine learning and computer vision research is still evolving [1]. Computer vision is an essential part of Internet of Things, Industrial Internet of Things, and brain human interfaces. The complex human activities are recognized and monitored in multimedia streams using machine learning and computer vison.

  21. Survey of Internet of Things Applications using Raspberry ...

    This paper gives a survey of Internet of Things (IoT) solutions using Raspberry Pi (RPi) Single Board Computer (SBC) and methods of Artificial Intelligence (AI) area - Computer Vision (CV). Solutions for several areas of IoT applications are presented and compared. An overview of the used hardware, software, CV methods, and CV algorithms are given.

  22. Special Issue on Computer Vision and Games

    This special issue invites research papers aiming to bridge the existing gaps between computer vision research and games engineering, with the motive of bringing together the games research community and the computer vision community that have largely operated independently until now. We are inviting papers for two main tracks.

  23. (PDF) Computer Vision Techniques in Manufacturing

    Statistics of keywords of different manufacturing stages in computer vision research papers from 1970 to 2020. ... 2 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS.

  24. computer vision Latest Research Papers

    computer vision Latest Research Papers | ScienceGate computer vision Recently Published Documents TOTAL DOCUMENTS 11111 (FIVE YEARS 4372) H-INDEX 109 (FIVE YEARS 19) Latest Documents Most Cited Documents Contributed Authors Related Sources Related Keywords 2D Computer Vision 10.1142/12497 2022 Author (s): Yu-Jin Zhang Keyword (s):