Faster LCAO two-center integrals
The LCAO overlap code performs a large number of additions, multiplications etc. on quite small matrices. This is quite time consuming because Python adds a lot of overhead. Rewrite most performance critical part in C.