This is totally amazing:As someone who works on manufacturing automation with robotics and vision, I can say this is a very complicated task since the robot has to distinguish by a 2D image between the right crop and weeds. Also the laser shooting of the weeds is awesome!
Saturday, August 28, 2021
Saturday, August 21, 2021
AI21 is a research lab is the Israeli equivalent OpenAI, founded by several machine learning luminaries including Prof. Amnon Shashua (MobileEye, Orcam, Digial Bank) who is a Prof at the Hebrew University. (Amnon was my lecturer for the ML course, which was an amazing course and he is an amazing person as well).
This week AI21 announced the release of the largest NLP model called Jurassic-1. It is a comparable model to GPT-3. The is no objective evaluation of the two models, but AI21 mentions that the number of word tokens used to train the models is 250K (compared to around 50K of GPT-3) which gives more flexibility in answering questions regarding common phrases, named entities etc. A great tutorial for GPT-3 is given in Yannic's Youtube Channel:
Building such a large NLP model is challenging, since the model has around 170B parameters and you need weeks of training with hundreds of GPUs, a cost that typically only the biggest companies can afford. Another interesting company I recently met is LightOn which builds photon based hardware to training language models, they recently announced the largest French based model.
It will be interesting to see when AI21 and similar companies will move to training non-English corpuses which is the place such companies can shine.
An interesting conference coming up soon is the NLP Summit (An online event Oct 5-7).
Saturday, August 7, 2021
I recently stumbled upon Yannic's Youtube Channel and I was totally blown away. Yannic is a fresh PhD out of ETH Zurich and he has few dozens of recent deep learning papers explained amazingly well. Both the selection of papers is smart, as well as the explanation of the content. In addition for some of the papers he adds personal comments and critics about the papers claims which really make sense. The audience for those tutorials is advanced deep learning audience and they cover advanced topics which Coursera courses mostly did not catch up yet. For example great coverage of transformers for both language and image models.
According to his LinkedIn, Yannic recently started a company along with 3 other ETH PhDs called DeepJudge which deploys deep learning NLP models in the legal domain. The company is 4 months old and according to CrunchBase raised a small seed round.
Based on the brains of the DeepJudge team, I call all the VCs, headhunters, university recruiters and everyone else to wake up! I am pretty sure we will here a lot of those guys.