您的位置: 专家智库 > >

国家自然科学基金(s60703046)

作品数:1 被引量:0H指数:0
发文基金:国家自然科学基金国家重点基础研究发展计划更多>>
相关领域:自动化与计算机技术更多>>

文献类型

  • 1篇中文期刊文章

领域

  • 1篇自动化与计算...

主题

  • 1篇BACKUP
  • 1篇DE
  • 1篇DUPLIC...
  • 1篇LOOKUP
  • 1篇FINGER...
  • 1篇POST-P...

传媒

  • 1篇Journa...

年份

  • 1篇2010
1 条 记 录,以下是 1-1
排序方式:
Scalable high performance de-duplication backup via hash join
2010年
Apart from high space efficiency,other demanding requirements for enterprise de-duplication backup are high performance,high scalability,and availability for large-scale distributed environments.The main challenge is reducing the significant disk input/output(I/O) overhead as a result of constantly accessing the disk to identify duplicate chunks.Existing inline de-duplication approaches mainly rely on duplicate locality to avoid disk bottleneck,thus suffering from degradation under poor duplicate locality workload.This paper presents Chunkfarm,a post-processing de-duplication backup system designed to improve capacity,throughput,and scalability for de-duplication.Chunkfarm performs de-duplication backup using the hash join algorithm,which turns the notoriously random and small disk I/Os of fingerprint lookups and updates into large sequential disk I/Os,hence achieving high write throughput not influenced by workload locality.More importantly,by decentralizing fingerprint lookup and update,Chunkfarm supports a cluster of servers to perform de-duplication backup in parallel;it hence is conducive to distributed implementation and thus applicable to large-scale and distributed storage systems.
Tian-ming YANG Dan FENG Zhong-ying NIU Ya-ping WAN
关键词:POST-PROCESSING
共1页<1>
聚类工具0