Doctoral Speaking Skills Talk - Sanjith Athlur May 30, 2025 12:00pm — 1:00pm Location: In Person - Newell-Simon 4305 Speaker: SANJITH ATHLUR , Ph.D. Student, Computer Science Department, Carnegie Mellon University https://csd.cmu.edu/people/doctoral-student/sanjith-athlur Okapi: Decoupling Data Striping and Redundancy Grouping in Cluster File Systems The Okapi cluster file system decouples how data is spread across disks (data striping) for IO efficiency from how data is erasure coded together (redundancy grouping) for durability. Existing systems couple these two mechanisms’ configurations, inducing significant inefficiencies. Decoupling allows grouping to be configured based on reliability and space efficiency goals, while simultaneously allowing striping to be configured based on performance goals. Decoupling also allows redundancy scheme changes from one EC scheme to another (e.g., to react to data temperature or disk failure rate changes) to occur without having to re-write data. Evaluation of an Okapi prototype shows that decoupling can be accomplished with <1% increase in metadata size and file manager memory, and minimal file creation and degraded read resource increase. Overall, experiments demonstrate that decoupling can improve read throughput by 80% and reduce seeks per second by up to 70%, without yielding any data reliability, and reduce the overhead of redundancy transitions by ~50%. Presented in Partial Fulfillment of the CSD Speaking Skills Requirement Add event to Google Add event to iCal