/virtual-window

My BSc final project; a proof-of-concept augmented reality software that tracks the head of the user via an ordinary laptop camera, projects the 3D scene behind accordingly onto the screen, and creates the illusion of a window to that 3D world behind.

Primary LanguageC++MIT LicenseMIT

Virtual Window project is about making the 3D computer graphics applications more realisticly three dimensional. It demonstrates the idea of projecting the 3D world behind onto screen with respect to the viewer's position, on a Universal Windows Platform application written in C++ using DirectX 11 and DirectXTK.

Before I had started to this project, my aim was to turn my computer screen into a window to the virtual world behind, hence I chose the name Virtual Window for the project.

Overview

Today's computer screens are usually flat, and limited to displaying us a plane of image. However, with just a simple 3D perspective projection of a modelled 3D world, we can create the illusion of a 3D world behind the screen.

On the other hand, we can easily see through this illusion, and find out that what we see is just a projection. How? Try:

  1. closing one of our eyes and then the other, or
  2. moving our heads up and down, left and right.

What you see on your computer screen will likely not change when you do those. There are some technologies that allows you to see different images with your both eyes, some of them even without special glasses. With or without glasses, they always require a special screen, but then make it possible for an application to pass the first test. This project is not about the first test.

Virtual Window aims to let the 3D applications pass the second test. The world behind is projected with respect to the viewer's position, so it becomes harder for the viewer to realize that it is just a projection, and the experience is more realistic and immersive. Finally, we can and we do this only by using a single front-facing camera.

Motivation

This idea to make the 3D projection responsive to my movements against the screen as a viewer came to me when I was taking the introductory course to computer graphics. I grew more passionate about the idea as I remembered how I was trying to see down below by moving my head up close to the screen when I was playing an old 3D game as a child.

I was wondering how come nobody had ever thought of this before, but also was sure that nobody hasn't, since otherwise there would definitely be at least some games or applications that I would have heard of using this technique to provide more immersive experience to their users. Thinking that I was onto something novel, I had worked on this project fully motivated for weeks.

I am glad that it was after I have finished preparing my demonstrating application, when I found out that other people had already done this before. It was still disappointing, but would also be demotivating if I were to discover this when my project was half-done.

Shortcomings

There are a couple of device-dependent parameters that I have hard-coded, which are the major shortcomings of this application. Those are:

  1. the focal length, and
  2. the position of the camera.

I have configured them to work with a Surface Pro 4. On top of those, it also has been calibrated to my head size. The difference in the head size should not be that so significant, however.

To change the application to suit your device, adjust the imagePlaneDistance parameter in the EyeDetector::UpdateEyePosition(void) within EyeDetector.cpp. The parameter is not immediately the focal length in any dimensions, but rather a convenient measure. Its decrease will make the application more sensitive to your head movements along the parallel plane, and the opposite will happen as it increases. Change and adjust it accordingly.

If the camera of your device is somewhere different than the top-center of your screen, then you will also have to make a small change for that. Inside MoveEye function within Sample3DSceneRenderer.cpp, you will have to change the line where I increase the Y by half the hundredth of of outputSize.Height. Depending on where your camera is, increase or decrease the X and/or Y by a fraction of the hundredth of outputSize.Width and/or outputSize.Height, respectively. For example, if your camera is at the bottom right corner of your screen, you should have something like the following:

// Compensation for camera position
X += outputSize.Width  / 200;
Y -= outputSize.Height / 200;

If you think that there is a problem with the head size assumption, you can play with the distanceFactor right above the imagePlaneDistance, which we have previously adjusted for the focal length.