SurMo is a novel dynamic human rendering paradigm that unifies temporal motion dynamics and human appearance modeling within a single framework for achieving high-fidelity human rendering. This method efficiently encodes human motion through a surface-based three-plane representation and designs physical motion decoding modules and 4D appearance decoding modules to synthesize time-varying human appearance effects, such as clothing wrinkles and motion shadows. Compared to existing methods, SurMo demonstrates significant improvements in both quantitative and qualitative rendering metrics.