ShanmugaEE 645 3-D Computer Vision 2018-19
Course ContentReview of linear algebra, calculus of variations, signals and systems; Camera and image formation - optics; Feature detectors - edge and corner detection; Feature descriptors - SIFT, SURF, feature matching; Shape from X - Reflectance map, BRDF, shape from shading, photometric stereo, depth from defocus, depth from focus, RGB-D images; Single view geometry - finite projective cameras, camera parameters, point correspondences, estimation of camera matrix, direct linear transformation (DLT); Two view geometry - homography, epipolar geometry, estimation of fundamental matrix, image rectification, stereo correspondence, shape from stereo; Three view geometry - trifocal tensors; Motion - optical flow field, Estimation of dense and accurate optical flow field; Multi view geometry - structure from motion, triangulation, factorization, bundle adjustment; Internet vision - mining community photo collections (Flickr, Facebook, etc.). Textbooks
The first 3 books are the best ones to learn from while the next 4 books provide alternate treatment of certain topics. References
The book by Marr provides a viewpoint based on visual neuroscience concepts. The next 5 books can be used as reference for certain topics. Apart from these books, some topics would be taught from selected research papers. Lecture notes you make in the classroom will provide pointers to look into topics in different books listed above. The topics taught in a lecture may have evolved from multiple books and research papers. Reading books would certainly aid lectures but can never replace the lectures. Suggested Readings
These suggested readings supplement the textbooks and reference books to understand various mathematical concepts in depth. Grading
Expected Learning OutcomesThe world we live has three dimensions (3D). Human visual system has evolved to perceive all these dimensions. However, the images we capture using conventional cameras are just the 2D projections of the 3D world. In 3D Computer Vision course, we shall explore various techniques for recovering the missing third dimension (depth information) from 2D images using primarily variational methods and projective geometry concepts. The course contents would enable the student to reconstruct the 3D real world scene from 2D images by various methods. The applications of this course range from cultural heritage to medical imaging, from robot navigation to 3D modeling. The assignments and projects associated with the course to be completed using OpenCV-Python would enable students to develop state-of-the-art 3D computer vision applications. This course is offered as an elective for BTech, MTech, and PhD students of IIT Gandhinagar. This course is also prescribed for a minor degree in Computer Science. Contacting InstructorPrimary mode of contact will be to send an email and fix an appointment to meet. Queries may be posted in Google classroom. Similar Courses offered at Other Top UniversitiesLicenseCopyright © 2017 Shanmuga. |