It Took Me 30 Years to Solve This VFX Problem – Green Screen Problem [video]

[−] Springtime 60d ago

In an earlier video they made a couple years back about Disney's sodium vapor technique Paul Debevec suggested he was considering creating a dataset using a similar premise: filming enough perfectly masked references to be able to train models to achieve better keying. So it was interesting seeing Corridor tackle this by instead using synthetic data.

[−] somat 60d ago

With regards to the sodium vapor process, an idea has been percolating in the back of my head ever since I saw that video. But I don't really have the budget to try it out.

theory: make the mask out of non-visable light

illuminate the backing screen in near Infra-Red light. (after a bit of thought I chose near-IR as opposed to near-UV for hopefully obvious reasons)

point two cameras at a splitting prism with a near IR pass filter(I have confirmed that such thing exists and is commercially available)

Leave the 90 degree(unaltered path) camera untouched, this is the visible camera.

Remove the IR filter from the 180 degree(filter path) camera, this is the mask camera.

Now you get a perfect non-color shifting mask(in theory), The splitting prism would hurt light intake. It might be worth it to try putting the cameras really close together , pointed same direction, no prism, and see if that is close enough.

[−] diacritical 60d ago

From ~04:10 till 05:00 they talk about sodium-vapor lights and how Disney has the exclusive rights to use it. From what I read the knowledge on how to make them is a trade secret, so it's not patented. Seems weird that it would be hard to recreate something from the 1950's.

I also wonder how many hours were wasted by people who had to use inferior technology because Disney kept it secret. Cutting out animals and objects from the background 1 frame at a time seems so mindnumbingly boring.

[−] jayd16 60d ago

As far as alternatives, I wonder if anyone has tried a screen that cycles through colors in a known sequence. Using this modulating-color screen, it might actually be easier to separate the subject because you get around the "green shirt over green screen" problem. You might even be able to use a time sampling to correct the light cast on the subject from the screen as you would have a full spectrum of response.

I could also imagine using polarized light as the backdrop as well.

[−] lynnharry 59d ago

It’s fascinating to see the bridge between academic research and industry application here. While Image Matting is a massive research area in Computer Vision, academia often focuses on solving perfect 'benchmarks.' Corridor Crew effectively took that foundational research, like neural unmixing and synthetic training, and adapted it to solve the 'messy' reality of production, like tracking markers and motion blur. It’s a great example of using open-source deep learning resources to build a tool that prioritized workflow over just a high accuracy score.

[−] IshKebab 60d ago

Pretty impressive results! Seems like someone has even made a GUI for it: https://github.com/edenaion/EZ-CorridorKey

Still Python unfortunately.

[−] swframe2 60d ago

The model in this repo seems pretty good: https://github.com/xuebinqin/DIS

[−] vsviridov 60d ago

The community has managed to drastically lower hardware requirements, but so far I think only Nvidia cards are supported, so as an AMD owner I'm still missing out :(

[−] superjan 60d ago

Watched this a few days ago. The video is light on technical details, except maybe that they used CGI to generate training data.

[−] mk_stjames 59d ago

In case anyone wanted technical details of the NN, I dug into the repo:

Its a transformer, with a CNN refiner after. Specifically, a ViT using the Hiera architecture (https://github.com/facebookresearch/hiera)

The Hiera ViT has dual decoder heads, one for the alpha and one for the RGD foreground, and then a small CNN refiner network to solve some artifacting in the output from the Hiera model.

I'd be very interested to see a long form tech talk of Niko explaining his process of learning ML ropes and building this model.

[−] comex 60d ago

See also this video comparing Corridor Key to traditional keyers:

https://www.youtube.com/watch?v=abNygtFqYR8

[−] amelius 60d ago

There's still a bug: the glass with water does not distort the checker pattern in the background at 24:12.

[−] amelius 60d ago

Is it a coincidence that the result is stable between subsequent frames?

[−] jcmoscon 59d ago

It is refreshing to see problems being solved by AI that are not LLMs. There are so many day-to-day challenges that we could solve using data, machine learn and some creativity.

[−] ralusek 60d ago

I'm a software engineer that, like the vast majority of you, uses AI/agents in my workflow every day. That being said, I have to admit that it feels a little weird to hear someone who does not write code say that they built something, without even mentioning that they had an agent build it (unless I missed that).

[−] qingcharles 60d ago

I use the Adobe version of this in Photoshop every day and I assumed that Adobe solved this the same way, but used professionals to cut out the subjects from the backgrounds then fed both versions into their AI.

Since they added it a year or so ago it has been game-changing. I'm cutting out portraits every day and having a magical tool that cuts out the subject with perfect hair cut out with a single click is sci-fi.

Here's a demo of Photoshop's tool:

https://www.youtube.com/watch?v=SNVJN6PKeGQ

(the other magical Photoshop tool is the one that removes reflections from windows, which is even more insane when you reverse it and tell it you only want the reflection and not what's on the other side of the glass)

[−] Computer0 60d ago

Looking forward to trying it out, 8gb of vram or unified memory required!

It Took Me 30 Years to Solve This VFX Problem – Green Screen Problem [video] (youtube.com)

113 comments