Vision and Image Course Project_综合

COURSE PROJECT - CS5187 VISION AND IMAGE

项目要求

1. EXPECTATION

The course project can be in group or individual. Through the project, you are expected to show knowledge (B grade), technical capability (B to A grade) and creativity (A grade) in any topic related to computer vision and/or image processing.

2. GROUPING

Each group can have at most three members. It is OK to work alone. The grades of group members are not necessarily the same depending on contribution. The expectation for the scope of a project, as well as the grade, will be proportional to the group size.

3. SCOPE

No two groups can have the exactly same title and content. All topics need to be approved by Prof. Ngo. If a proposed topic is overlapped with another group, the approval will be based on First-Come-First-Approve. The scope of a project can be in one (or multiple) of the following formats:

Algorithm Implementation and Result Comparison

Implement at least two algorithms/techniques (learn in class or elsewhere) and conduct empirical evaluation for performance comparison.

Proof-of-concept

You can propose a new algorithm/technique and show the improvement in theory and/or by experiment. You may or may not implement the algorithm, but a complete description of proposal, algorithm analysis and expected result is required.

Application of algorithms/techniques for innovative application

You can implement a vision or image related system to demonstrate your innovation. You are required to showcase the system by the end of project.

Critical survey

A survey of at least five research papers for a specific problem of computer vision and/or image processing. The survey must include critical analysis of strength and limitation of different techniques.

4. SCHEDULE

5. SUGGESTED TOPICS

- Image segmentation

- 3D estimation from image sequence

- 3D volume estimation

- Application using dual-camera mobile phone (e.g., using ARKit in iphone)

- Image or video retrieval

- Image or video captioning

- X detection and tracking (X = face, gesture or pose, vehicle, pedestrian)

- X recognition for interactivity (X = face, gesture or pose)

- Facial expression recognition

- X recognition with deep learning (X = object, scene, activity, 3D model)

- Seam carving

- Domain specific image processing (medicine, food, traffic, cultural heritage)

- Person re-identification in multi-camera setting

- Video surveillance

- Image editing with computer vision techniques

- Multi-sensor (one of the sensors must be camera) analysis or application

- Super-resolution image reconstruction

2-15开会：确定题目 Faical Expression Recognition

https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/leaderboard

https://github.com/amineHorseman/facial-expression-recognition-using-cnn#fer2013

https://github.com/search?q=Facial+Expression

2-22开会：

下周安排

每个人在github上找合适的2013kaggle比赛相关的解决方法（自己本地跑通）

确定题目后编辑email并发送，安排每个人的role

3-15开会：

1.确定自己要做哪个项目

2.看懂项目代码逻辑，理解模型(CNN/VGG)的原理

3.写一个文档大致介绍一下

Dongyu Zhao https://github.com/WuJie1010/Facial-Expression-Recognition.Pytorch

ShaoHeng He https://github.com/oarriaga/face_classification

Kejun Du 待定