Recommendable! An interesting paper by Google and UCLA. Among other things it presents a simpler optimization algorithm compared to the de facto default ones that are being used, e.g. AdamW. This algorithm is already being used by Google in production.
No comments:
Post a Comment