The correct and resilient operation of distributed systems-spanning global financial ledgers, decentralized autonomous organizations, and peer-to-peer energy microgrids-depends fundamentally on the correctness of their underlying consensus mechanisms. These protocols must guarantee agreement on shared state among a collection of potentially faulty or adversarial nodes, upholding the dual properties of safety and liveness even under hostile conditions. Despite the growing diversity of consensus algorithms, from classical crash-fault tolerant approaches such as Paxos and Raft to modern Byzantine fault-tolerant (BFT) variants and Directed Acyclic Graph (DAG)-based structures, the research community lacks a unified architecture for their systematic, cross-platform evaluation. This paper proposes a modular, extensible framework called the Consensus Evaluation and Resilience Framework (CERF) to fill this gap. CERF integrates five core components: a high-fidelity network emulation layer, a pluggable system-under-test (SUT) adapter, a fault injection engine (FIE) capable of simulating both benign and Byzantine failure modes, a multi-dimensional performance monitoring unit (PMU), and a formal consistency checker. By decoupling the testing infrastructure from protocol implementation, CERF enables fair comparisons of throughput, latency, energy efficiency, and scalability. The proposed architecture draws on insights from seminal benchmarking works including BlockBench, Jepsen, and ByzzBench, and extends evaluation criteria to address modern challenges in IoT environments, post-quantum security, and DAG-based ledgers. Graph-theoretical principles are incorporated at the topology design stage to model network vulnerabilities and guide adversarial scenario construction.