The smallest useful stack is enough for most beginners: a way to create still visual direction, a way to turn that direction into motion, an editor to shape the sequence, and an audio layer to add voice, music, or atmosphere.
Until the workflow is clear, adding more tools usually creates friction instead of quality. Better decisions beat bigger software collections.