Tag: Google

  • Google Releases Real-time Hand Tracking

    Google Releases Real-time Hand Tracking

    In a recent web-article on Road to VR: Google hand tracking, I found out about a technology Google has just made available to the public called, BlazePalm, which is part of their MediaPipe, an open source cross-platform framework for developers looking to build processing pipelines to handle perceptual data, like video and audio. BlazePalm is a new approach to hand perception which provides high-fidelity hand and finger tracking via machine learning, which can infer 21 3D ‘keypoints’ of a hand from just a single frame.

    21 Key Point Tracking

    Google Research engineers Valentin Bazarevsky and Fan Zhang describe their work in their full blogpost . While I have not read the full blogpost myself, Bazarevsky and Zhang describe, “Whereas current state-of-the-art approaches rely primarily on powerful desktop environments for inference, our method achieves real-time performance on a mobile phone, and even scales to multiple hands.” A few of the major points from the blogpost are:

    • The BlazePalm technique is touted to achieve an average precision of 95.7{76c5cb8798b4dc9652375d1c19c86d53c1d1411f4e030dd406aa284e63c21817} in palm detection, researchers claim.
    • The model learns a consistent internal hand pose representation and is robust even to partially visible hands and self-occlusions.
    • The existing pipeline supports counting gestures from multiple cultures, e.g. American, European, and Chinese, and various hand signs including “Thumb up”, closed fist, “OK”, “Rock”, and “Spiderman”.
    • Google is open sourcing its hand tracking and gesture recognition pipeline in the MediaPipe framework, accompanied with the relevant end-to-end usage scenario and source code, here.
    Google Hand Tracking Indoors

    While this is an interesting technology geared towards the mobile community, I feel it has far greater impact into the AR/VR/MR communities as well. First, this looks like some pretty decent hand recognition technology being made available to the entire community, free of charge. While this may have deep financial impact to companies such as Leap Motion, I feel it democratizes the hand recognition world. While I will need to reserve judgement until I see the technology in action but this may have even better performance than Leap Motion and without the purchase of expensive external hardware.

    Google Hand Tracking Outdoors

    Being able to implement hand tracking into a VR experience today was only an expensive pipe dream. But now, with a bit of time and ingenuity, this technology can be brought to real-time VR without expensive hardware and software. As the researchers continue to work on refining and tightening their technology, I can’t help but wonder if other tracking barriers, such as facial tracking, are also in the verge of crumbling!

    Multiple Hand Tracking
  • Google VR 101

    This article about Google’s VR 101 seems like something to check out.

  • Google Project Soli

    Now here is some tech that has real potential. This article from Road to VR, “Google’s Project Soli,” has some real potential. By harnessing radar in the 21st centuray, researchers are capturing very small hand gestures.

    While this technology has some super dramatic implications with virtual VR UI I’m surprised no one has taken this idea and pushed it someplace else. Why have they not explored facial recognition. From what it sounds, if the specs being claimed are true, then something like lip sync and facial recognition seems immanent. However, maybe the sounds waves coming from our mouths may mess with the radar. I don’t know.

     

  • Google Attempting VR World Domination

    A couple of articles today track the behavior of Google.

    It would seem the media giant is assembling a team to create its own suite of VR tools. This is no doubt an attempt to achieve VR domination. Who can blame them. The wave is crashing hard. The big boys are getting out their big guns to claim big territories in the new frontier.

    Adudio will play a huge role in the achievement of immersive theme-worlds and google want s to be the pack leader by setting its own VR audio standards.

    Google also seems to be designing its own VR engine. This is a no-brainer. It has been clearly evident for quite a while the mechanics for immersive theme-wolds will be more demanding and more varied than thos supplied by modern game engines. MOst game engine will provide a launch pad for a broader, more diversified VR engine.

    They are also building a VR camera from scratch.  I’m not to sure I understand the motivation for this one. There are already many players in this field. What could google gain by building a specialized and dedicated camera? As technology continues to roll out we’ll have to see how things are inter-related.