Added the small weight embedding + id layer norm inits.