• darhodester an hour ago

    Hi,

    I'm David Rhodes, Co-founder of CG Nomads, developer of GSOPs (Gaussian Splatting Operators) for SideFX Houdini. GSOPs was used in combination with OTOY OctaneRender to produce this music video.

    If you're interested in the technology and its capabilities, learn more at https://www.cgnomads.com/ or AMA.

    Try GSOPs yourself: https://github.com/cgnomads/GSOPs (example content included).

    • sbierwagen an hour ago

      From the article:

      >Evercoast deployed a 56 camera RGB-D array

      Do you know which depth cameras they used?

      • darhodester an hour ago
        • secretsatan 35 minutes ago

          Couldn’t you just use iphone pros for this? I developed an app specifically for photogrammetry capture using AR and the depth sensor as it seemed like a cheap alternative.

          EDIT: I realize a phone is not on the same level as a red camera, but i just saw iphones as a massively cheaper option to alternatives in the field i worked in.

          • F7F7F7 27 minutes ago

            ASAP Rocky has a fervent fanbase who's been anticipating this album. So I'm assuming that whatever record label he's signed to gave him the budget.

            And when I think back to another iconic hip hop (iconic that genre) video where they used practical effects and military helicopters chasing speedboats in the waters off of Santa Monica...I bet they had change to spear.

            • darhodester 8 minutes ago

              I think it's because they already had proven capture hardware, harvest, and processing workflows.

              But yes, you can easily use iPhones for this now.

              • fastasucan 13 minutes ago

                Why would they go for the cheapest option?

              • darhodester an hour ago

                I was not involved in the capture process with Evercoast, but I may have heard somewhere they used RealSense cameras.

                I recommend asking https://www.linkedin.com/in/benschwartzxr/ for accuracy.

                • brcmthrowaway an hour ago

                  Kinect Azure

              • nodra 2 hours ago

                Never did I think I would ever see anything close to related to A$AP on HN. I love this place.

                • keiferski an hour ago

                  Hah, for the past day, I've been trying to somehow submit the Helicopter music video / album as a whole to HN. Glad someone figured out the angle was Gaussian.

                  • wahnfrieden an hour ago

                    And nearly a Carti post at the top of HN

                    • rw_panic0_0 23 minutes ago

                      I'm taking the opportunity to FWAEH in here

                    • stickfigure an hour ago

                      Is he wearing... hair curlers?

                      • nodra an hour ago

                        That's what one does when they want some fiyah curls.

                    • pleurotus 2 hours ago

                      Super cool to read but can someone eli5 what Gaussian splatting is (and/or radiance fields?) specifically to how the article talks about it finally being "mature enough"? What's changed that this is now possible?

                      • tel an hour ago

                        Gaussian splatting is a way to record 3-dimensional video. You capture a scene from many angles simultaneously and then combine all of those into a single representation. Ideally, that representation is good enough that you can then, post-production, simulate camera angles you didn't originally record.

                        For example, the camera orbits around the performers in this music video are difficult to imagine in real space. Even if you could pull it off using robotic motion control arms, it would require that the entire choreography is fixed in place before filming. This video clearly takes advantage of being able to direct whatever camera motion the artist wanted in the 3d virtual space of the final composed scene.

                        To do this, the representation needs to estimate the radiance field, i.e. the amount and color of light visible at every point in your 3d volume, viewed from every angle. It's not possible to do this at high resolution by breaking that space up into voxels, those scale badly, O(n^3). You could attempt to guess at some mesh geometry and paint textures on to it compatible with the camera views, but that's difficult to automate.

                        Gaussian splatting estimates these radiance fields by assuming that the radiance is build from millions of fuzzy, colored balls positioned, stretched, and rotated in space. These are the Gaussian splats.

                        Once you have that representation, constructing a novel camera angle is as simple as positioning and angling your virtual camera and then recording the colors and positions of all the splats that are visible.

                        It turns out that this approach is pretty amenable to techniques similar to modern deep learning. You basically train the positions/shapes/rotations of the splats via gradient descent. It's mostly been explored in research labs but lately production-oriented tools have been built for popular 3d motion graphics tools like Houdini, making it more available.

                        • dmarcos an hour ago

                          It’s a point cloud where each point is a semitransparent blob that can have a view dependent color: color changes depending on direction you look at them. Allowing to capture reflections, iridescence…

                          You generate the point clouds from multiple images of a scene or an object and some machine learning magic

                          • djeastm 2 hours ago

                            For the ELI5, Gaussian splatting represents the scene as millions of tiny, blurry colored blobs in 3D space and renders by quickly "splatting" them onto the screen, making it much faster than computing an image by querying a neural net model like radiance fields.

                            I'm not up on how things have changed recently

                            • rkuykendall-com 2 hours ago

                              I found this VFX breakdown of the recent Superman movie to have a great explanation of what it is and what it makes possible: https://youtu.be/eyAVWH61R8E?t=232

                              tl;dr eli5: Instead of capturing spots of color as they would appear to a camera, they capture spots of color and where they exist in the world. By combining multiple cameras doing this, you can make a 3D works from footage that you can then zoom a virtual camera round.

                              • arcfour an hour ago

                                So it's the 3D equivalent of a camera (using many cameras) that you can edit in 3D space too?

                            • badcryptobitch 4 minutes ago

                              Both of my worlds are colliding with this article. I love reading about how deeply technical products/artifacts get used in art.

                              • noman-land 2 hours ago

                                Really amazing video. Unfortunately this article is like 60% over my head. Regardless, I actually love reading jargon-filled statements like this that are totally normal to the initiated but are completely inscrutable to outsiders.

                                    "That data was then brought into Houdini, where the post production team used CG Nomads GSOPs for manipulation and sequencing, and OTOY’s OctaneRender for final rendering. Thanks to this combination, the production team was also able to relight the splats."
                                • darhodester an hour ago

                                  Hi, I'm one of the creators of GSOPs for SideFX Houdini.

                                  The gist is that Gaussian splats can replicate reality quite effectively with many 3D ellipsoids (stored as a type of point cloud). Houdini is software that excels at manipulating vast numbers of points, and renderers (such as Octane) can now leverage this type of data to integrate with traditional computer graphics primitives, lights, and techniques.

                                  • suzzer99 an hour ago

                                    Can you put "Gaussing splats" in some kind of real world metaphor so I can understand what it means? Either that or explain why "Gaussian" and why "splat".

                                    I am vaguely aware of stuff like Gaussian blur on Photoshop. But I never really knew what it does.

                                    • darhodester an hour ago

                                      Sure!

                                      Gaussian splatting is a bit like photogrammetry. That is, you can record video or take photos of an object or environment from many angles and reproduce it in 3D. Gaussians have the capability to "fade" their opacity based on a Gaussian distribution. This allows them to blend together in a seamless fashion.

                                      The splatting process is achieved by using gradient descent from each camera/image pair to optimize these ellipsoids (Gaussians) such that the reproduce the original inputs as closely as possible. Given enough imagery and sufficient camera alignment, performed using Structure from Motion, you can faithfully reproduce the entire space.

                                      Read more here: https://towardsdatascience.com/a-comprehensive-overview-of-g....

                                      • shwaj 41 minutes ago

                                        How can you expect someone to tailor a custom explanation, when they don’t know your level of mathematical understanding, or even your level of curiosity. You don’t know what a Gaussian blur does; do you know what a Gaussian is? How deeply do you want to understand?

                                        If you’re curious start with the Wikipedia article and use an LLM to help you understand the parts that don’t make sense. Or just ask the LLM to provide a summary at the desired level of detail.

                                  • pants2 an hour ago

                                    Corridor has done some great stuff with Gaussian Splats, I recommend this video for a primer!

                                    https://youtube.com/watch?v=cetf0qTZ04Y

                                  • rjh29 2 hours ago

                                    To be honest it looks like it was rendered in an old version of Unreal Engine. That may be an intentional choice - I wonder how realistic guassian splatting can look? Can you redo lights, shadows, remove or move parts of the scene, while preserving the original fidelity and realism?

                                    The way TV/movie production is going (record 100s of hours of footage from multiple angles and edit it all in post) I wonder if this is the end state. Gaussian splatting for the humans and green screens for the rest?

                                    • TeMPOraL 3 minutes ago

                                      For me it felt more like higher detail version of Teardown, the voxel-based 3d demolition game. Sure it's splats and not voxels, but the camera and the lighting give this strong voxel game vibe.

                                      • darhodester an hour ago

                                        The aesthetic here is at least partially an intentional choice to lean into the artifacts produced by Gaussian splatting, particularly dynamic (4DGS) splatting. There is temporal inconsistency when capturing performances like this, which are exacerbated by relighting.

                                        That said, the technology is rapidly advancing and this type of volumetric capture is definitely sticking around.

                                        The quality can also be really good, especially for static environments: https://www.linkedin.com/posts/christoph-schindelar-79515351....

                                        • F7F7F7 18 minutes ago

                                          Knowing what I know about the artist in this video this was probably more about the novelty of the technology and the creative freedom it offers rather than it is budget.

                                          • moi2388 an hour ago

                                            Yes, they talk about this in the article and that’s exactly what they did.

                                          • daveofiveo 2 hours ago

                                            Direct link to the music video: https://www.youtube.com/watch?v=g1-46Nu3HxQ

                                            • dang 37 minutes ago

                                              Good idea - we'll put that link in the toptext as well. Thanks!

                                            • roughly 2 hours ago

                                              Be sure to watch the video itself* - it’s really a great piece of work. The energy is frenetic and it’s got this beautiful balance of surrealism from the effects and groundedness from the human performances.

                                              * (Mute it if you don’t like the music, just like the rest of us will if you complain about the music)

                                              • filoleg 38 minutes ago

                                                Similarly, the music video for Taylor Swif[0] (another track by A$AP Rocky) is just as surrealistic and weird in the best way possible, but with an eastern european flavor of it (which is obviously intentional and makes sense, given the filming location and being very on-the-nose with the theme).

                                                0. https://youtu.be/5URefVYaJrA

                                              • narrator an hour ago

                                                This reminds me about how Soulja Boy just used a cracked copy of Fruity Loops and a cheap microphone and recorded all his songs that made him millions.[1] No big studio, or anything much required like the pre-digital music days. Now we got the same thing for music videos and soon movies. The people with no money who make culture are going to be some of the biggest beneficiaries from AI.

                                                [1] https://www.youtube.com/watch?v=f1rjhVe59ek

                                                • enneff 24 minutes ago

                                                  You might consider why this article which has nothing to do with AI as you know it (except for the machine learning aspects of Gaussian splatting), and was produced by a huge team of vfx professionals, has made you think about AI democratising culture (despite the fact that music videos and films have been cheap to make for decades). Don’t just look for opportunities to discuss your favourite talking points.

                                                  • narrator 21 minutes ago

                                                    Fair point actually. touché.

                                                  • comradesmith 40 minutes ago

                                                    I really don’t see the connection. A$AP isn’t a noob

                                                    • narrator 31 minutes ago

                                                      I bet this video was produced for dramatically much less than the CGI budget for something like this would have cost just a few years ago. In the early 90s, digital music production started with early DATs and MIDI setups and the prices just fell to almost nothing by the 2000s. We're probably going to see the same sort of thing going on with splatting and movie tech till it will be DIY on a last gen laptop.

                                                      A different example on rapid low budget media creation technical progress was the change in production values with some vloggers that have been around for a while like SVDelos. They started filming their sail around the world on 640x480 video in the 2000s. They've been on the ocean for more than a decade now and progressively added 4k video, drones, much better editing, etc. The impact of drones on making shots of rarely visited uninhabited islands in the South Pacific is particularly dramatic as it would have been extremely expensive to do those shots with previous aerial photography techniques.

                                                  • jtolmar 42 minutes ago

                                                    Dang, it's been cool watching gaussian splats go from tech demo to real workflow.

                                                    • darhodester 6 minutes ago

                                                      For sure!

                                                    • drdirk 2 hours ago

                                                      Can somebody explain to me what was actually scanned? Only the actors doing movements like push ups, or whole scenes / rooms?

                                                      • moribvndvs an hour ago

                                                        In another setting, it looks like ass, but lo-fi, glitchy shit is perfectly compatible with hip-hop aesthetic. Good track though.

                                                        • henshao 33 minutes ago

                                                          I think in 2026 it's hard to make a video look this "bad" without it being a clear aesthetic choice, so not sure you could find this video in another setting.

                                                          • ungreased0675 25 minutes ago

                                                            The technology is impressive, but the end result… Weapons-grade brainrot.

                                                            I’m curious what other artists end up making with it.

                                                            • F7F7F7 14 minutes ago

                                                              The article said it was by design...

                                                          • weinzierl 25 minutes ago

                                                            "The team also used Blender heavily for layout and previs, converting splat sequences into lightweight proxy caches for scene planning."

                                                            • sneak an hour ago

                                                              > One recurring reaction to the video has been confusion. Viewers assume the imagery is AI-generated. According to Evercoast, that couldn’t be further from the truth. Every stunt, every swing, every fall was physically performed and captured in real space. What makes it feel synthetic is the freedom volumetric capture affords.

                                                              No, it’s simply the framerate.

                                                              • yieldcrv 2 hours ago

                                                                so basically despite the higher resource requirements like 10TB of data for 30 minutes of footage, the compositing is so much faster and more flexible and those resources can be deleted or moved to long term storage in the cloud very quickly and the project can move on

                                                                fascinating

                                                                I wouldn't have normally read this and watched the video, but my Claude sessions were already executing a plan

                                                                the tl;dr is that all the actors were scanned in a 3D point cloud system and then "NeRF"'d which means to extrapolate any missing data about their transposed 3D model

                                                                this was then more easily placed into the video than trying to compose and place 2D actors layer by layer

                                                                • darhodester an hour ago

                                                                  Gaussian splatting is not NeRF (neural radiance field), but it is a type of radiance field, and supports novel view synthesis. The difference is in an explicit point cloud representation (Gaussian splatting), versus a process that needs to be inferred by a neural network.

                                                                  • andybak an hour ago

                                                                    > and then "NeRF"'d which means to extrapolate any missing data about their transposed 3D model

                                                                    Not sure if it's you or the original article but that's a slightly misleading summary of NeRFs.

                                                                    • yieldcrv an hour ago

                                                                      I'm all for the better summary

                                                                  • MuffinFlavored an hour ago

                                                                    How did Rhianna look him in the eyes and say "yes babe, good album, release it, this is what the people wanted after 7 years, it is pleasing to listen to and enjoyable"?

                                                                    • delbronski 9 minutes ago

                                                                      Im sure it was more like, “hey babe, can I get a few millions to go in the studio and experiment/make some art?” And then she was like, “yeah go for it! Make some weird shit.”

                                                                      If I was in his position I’d probably be doing the same. Why bother with another top hit that pleases the masses.

                                                                      • larsmaxfield an hour ago

                                                                        I prefer when artists make music they intrinsically want to make — not what others want them to make.

                                                                        • b00ty4breakfast an hour ago

                                                                          the real question is how much of the art is their own and how much is outside expectations and their reactions to it.

                                                                          And it's not always giving in to those voices, sometimes it's going in the opposite direction specifically to subvert those voices and expectations even if that ends up going against your initial instincts as an artist.

                                                                          With someone like A$AP Rocky, there is a lot of money on the line wrt the record execs but even small indie artists playing to only a hundred people a night have to contend with audience expectation and how that can exert an influence on their creativity.

                                                                        • perfmode 21 minutes ago

                                                                          It seems the numerous leaks and trials took their toll.

                                                                          I don’t disagree with you—I felt “Tailor Swif,” “DMB,” and “Both Eyes Closed” were all stronger than the tracks that made it onto this album.

                                                                          But sometimes you’ve gotta ship the project in the state it’s in and move on with your life.

                                                                          Maybe now he can move forward and start working on something new. And perhaps that project will be stronger.

                                                                          • weakfish 20 minutes ago

                                                                            Because it was awesome. But also, leaks probably.

                                                                          • londons_explore 2 hours ago

                                                                            Pretty sure most of this could be filmed with a camera drone and preprogrammed flight path...

                                                                            Did the Gaussian splatting actually make it any cheaper? Especially considering that it needed 50+ fixed camera angles to splat properly, and extensive post-processing work both computationally and human labour, a camera drone just seems easier.

                                                                            • hamburglar an hour ago

                                                                              > Pretty sure most of this could be filmed with a camera drone and preprogrammed flight path

                                                                              This is a “Dropbox is just ftp and rsync” level comment. There’s a shot in there where Rocky is sitting on top of the spinning blades of a helicopter and the camera smoothly transitions from flying around the room to solidly rotating along with the blades, so it’s fixed relative to rocky. Not only would programming a camera drone to follow this path be extremely difficult (and wouldn’t look as good), but just setting up the stunt would be cost prohibitive.

                                                                              This is just one example of the hundreds you could come up with.

                                                                              • nebezb an hour ago

                                                                                If it was achievable, cheaper, and of equal quality then it would have been done that way. Surely it would’ve been done that way a long time ago too. Drone paths have been around a lot longer than this technology.

                                                                                There’s no proof of your claim and this video is proof of the opposite.

                                                                                • larsmaxfield an hour ago

                                                                                  Flying a camera drone with such proximity and acceleration would be a safety nightmare.

                                                                                  • darhodester an hour ago

                                                                                    A drone path would not allow for such seamless transitions, never mind the planning required to nail all that choreography, effects, etc.

                                                                                    This approach is 100% flexible, and I'm sure at least part of the magic came from the process of play and experimentation in post.

                                                                                    • ex-aws-dude an hour ago

                                                                                      I think you’re missing the point

                                                                                      Volumetric capture like this allows you to decide on the camera angles in post-production

                                                                                      • F7F7F7 15 minutes ago

                                                                                        This might be the first time I'm stumbling on Dunning Kruger on HN, no offense.

                                                                                        • ThouYS 2 hours ago

                                                                                          it gives you flexibility, options

                                                                                          • echelon 2 hours ago

                                                                                            It's fucking cool. That's why.

                                                                                            This tech is moving along at breakneck pace and now we're all talking about it. A drone video wouldn't have done that.