[IO] replace the std::mutex.try_lock with our own std::atomic based implementation
std::mutex uses pthread_mutex which does a lot of stuff we don't need.
This change makes no gigantic difference but I is measureable therefore I think it is worth it. Some test results from our echo server bennchamr big03->big02 try_lock.pdf