Introduction about Technologies

Computer vision is a field of artificial intelligence that trains computers to interpret and understand the visual world. Using digital images from cameras and videos and deep learning models, machines can accurately identify and classify objects — and then react to what they “see.”

OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage then Itseez. The library is cross-platform and free for use under the open-source BSD license.

MediaPipe offers open source cross-platform, customizable ML solutions for live and streaming media.

How it works

Here we can control moves car game when yellow color come to;

  • right box (press key ‘d’)
  • left box (press key ‘a’)
  • left hand when thumb finger open (press key ‘w’)
  • right hand when thumb finger open (press key ‘s’)

first

Technologies & Libraries

  • opencv-python
  • mediapipe
  • pydirectinput
  • numpy

Download and Setup

Cloning the repo.

https://github.com/0xSE/steering-wheel-opencv-mediapipe.git

Setup

Running script-Setup.py file For windows users use:

python script-Setup.py

For linux or macOS use:

python3 script-Setup.py

Note

When install pydirectinput
Go to libaray file and delete all lines have "sleep"

Script still have some bugs

GitHub

View Github