Introduction about Technologies

Computer vision is a field of artificial intelligence that trains computers to interpret and understand the visual world. Using digital images from cameras and videos and deep learning models, machines can accurately identify and classify objects — and then react to what they “see.”

OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage then Itseez. The library is cross-platform and free for use under the open-source BSD license.

MediaPipe offers open source cross-platform, customizable ML solutions for live and streaming media.

How it works

Here we can control moves car game when yellow color come to;

  • right box (press key ‘d’)
  • left box (press key ‘a’)
  • left hand when thumb finger open (press key ‘w’)
  • right hand when thumb finger open (press key ‘s’)


Technologies & Libraries

  • opencv-python
  • mediapipe
  • pydirectinput
  • numpy

Download and Setup

Cloning the repo.


Running file For windows users use:


For linux or macOS use:



When install pydirectinput
Go to libaray file and delete all lines have "sleep"

Script still have some bugs


View Github