If you're used to desktop systems it's worth noting that the VC4, in common with most other mobile GPUs, is tile based and this has performance implications. The
OpenGL Insights book has a chapter on performance tuning for tile based architectures (which can be downloaded from that link).