文章列表
 
您正在查看 "P2p Related" 分类下的文章

2007-09-22 11:28
经济学模型在分享和竞争问题上确实有一套较为合理的解决办法,它解决的就是:我分享给你了我得到什么好处?我能不能在竞争中取得更多的优势?最典型的就是博弈论(Game theory)了。下面这个说的是另外一个方面,可以说抓住了目前P2P下载的一些较为实际的问题,解决这些问题还是比较有意义的。

http://www.linux.com/feature/119250

New P2P network uses bandwidth as currency


The only real exchange between peers in a traditional peer-to-peer network is limited to the files being transferred. Tribler is a new P2P network that's introducing social networking concepts to facilitate better interactions between users. Using new algorithms and protocols, Tribler users will also be able to cash in on their generous uploads for faster downloads.

The Tribler software is primarily developed by the researchers in the Delft University of Technology and De Vrije Universiteit Amsterdam. Sven Seuken, a second-year computer science Ph.D. student at Harvard's School of Engineering and Applied Sciences, says, "This year we started a research cooperation between the Tribler group and the Harvard Economics and Computer Science research group to introduce economic methodologies into the Tribler software." Seuken's adviser Prof. David Parkes from Harvard and Dr. Johan Pouwelse from Delft have developed the concept of a "virtual bandwidth currency."

Seuken says a virtual currency system can overcome the shortcomings of the BitTorrent protocol to improve the efficiency of the system. "The classic BitTorrent protocol is limited to the tit-for-tat mechanism," says Seuken, "meaning a user normally has to upload a file piece when he wants to download another piece." The problem is aggravated when you consider that many Internet connections are asymmetric, with download and upload speeds that can differ by as much as 4:1. This, says Seuken, becomes a problem when a user is downloading a file with the tit-for-tat mechanism; he can never use his full download capacity, because his limited upload capacity doesn't allow him to engage in any more tit-for-tat trades.

Furthermore, this system offers no incentive for users to upload once they have finished downloading, so many users turn off their file sharing client when they are done. By introducing a virtual currency, Tribler hopes to alleviate this problem. Users can earn currency for uploading, and they can spend it at a later time for fast downloads. Every user records his individual trade interactions with other peers, and using the BarterCast algorithm, this information is forwarded to other peers in the system. "Over time, a decentralized trust graph emerges that allows all users of the system to trade with the virtual currency."

Tribler also uses the social networking concept of connecting users in virtual friend circles to help download files. When downloading files, a group of users that are connected to each other sharing the same file are part of a "swarm." When a user from this swarm asks his friends for help, they get the information about which BitTorrent swarm he is currently working with and which file pieces are missing. "They can then join the same swarm as their friend, by downloading the same file, and then forward the pieces they have downloaded to their friend 'for free,' effectively donating upload bandwidth to him," Seuken says.

Linux clients available

Tribler offers clients for Windows, Mac OS X, and Linux. There are two Tribler feature-identical clients available, depending on whether your Internet connection is symmetric or asymmetric. Ubuntu users get a binary package to install Tribler, while other Linux users have to install from the source. Tribler depends on Python-wxGTK2.6, Python-m2crypto, VLC media player, ffmpeg, and Tor. The Ubuntu client has a bug that prevents files found on the network from being displayed, but there's a fix available.

In addition to using it as a P2P client, you can use Tribler as a regular BitTorrent client. It can find torrents automatically, or you can download torrent files from torrent Web sites. The currency concept works irrespective of where the torrent files come from. However, the currency concept will only work between Tribler users or users with compatible clients using the BarterCast and Give-to-Get algorithms.

Tribler lacks any content filtering features. "Tribler is an open P2P system and users can spread what they want, with the same (lack) of privacy as in BitTorrent," Seuken says.

Ongoing research

In addition to making friends, Tribler has a few other features adopted from social networking concepts. The project has just started a top-ten uploader list that lets users know which users are contributing the most to the P2P system.

Seuken says the developers are working on many important research topics at the same time, and in every new software release, new features will be implemented. "The feature with the largest impact will be the fully decentralized trust system. On top of the BarterCast algorithm, we are working on a protocol that makes the virtual currency trading secure, thus preventing fraud and attacks. Once operational, the download speed in P2P file sharing systems could easily be twice or three times as fast, if all users adopt the Tribler client or use compatible protocols." The team is also working on integrating more social network technology in the next version of Tribler -- for example, a chatting function for Tribler users to communicate with each other.

The Tribler project takes an interesting approach, mixing social networking concepts with peer-powered download networks. Already there's a lot of content on the network. With a number of unique features available and more still under development, Tribler has all the ingredients to redefine P2P networks.

 
2007-04-12 13:30
http://arstechnica.com/news.ars/post/20070410-accelerated-p2p-by-similarity-searches.html

Similarity searches accelerate P2P downloads by 30-70 percent

By John Timmer | Published: April 10, 2007 - 03:17PM CT

P2P file sharing offers the possibility of dramatically increased download speeds by avoiding the problems associated with a single (overloaded) server. But even P2P downloads can bog down when files are first seeded or if few active users have a copy of the file. A research team with members at Carnegie Mellon, Purdue, and Intel thinks they've found a way around some of these limitations using what they call Similarity-Enhanced Transfer (SET), a technique they claim can speed up P2P downloads by anywhere from 30 to 70 percent. They'll be presenting their technique at the 4th Symposium on Networked Systems Design and Implementation tomorrow.

The "Similarity" portion of SET comes from the realization that many of the files being shared contain pieces of identical data. Examples include music files that differ only in terms of tags, movies or movie trailers that are dubbed in different languages, and updated versions of software. Like other P2P systems, SET divides large files into small segments. Once that process is complete, however, the SET software searches for similar files using a method called "handprinting," which is similar to the pattern matching techniques used to cluster search results or filter spam. Once similar files are identified, they are scanned for any individual chunks that are identical to pieces of the file being downloaded.

As a result, SET should greatly expand the available sources of any given file. In practice, it seemed to work pretty well. Using existing P2P networks, they were able to grab a 30MB movie trailer in only a third of the time, since their software was able to find other sources that shared about 50 percent similarity. The rate of an MP3 download shot up by over 70 percent.

We may be able to see SET appearing in clients and distribution services soon. The presentation will come with actual implementation code, and the team hopes to see others put it to use. "This is a technique that I would like people to steal," said David Andersen of Carnegie Mellon, "Developers should just take the idea and use it in their own systems."


http://arstechnica.com/news.ars/post/20070411-inside-the-set-p2p-system.html

How SET accelerates P2P filesharing

By John Timmer | Published: April 11, 2007 - 11:05PM CT

Yesterday, we reported on the announcement of the SET filesharing system. SET claims to accelerate a typical P2P download by anywhere from 30-70 percent by increasing the potential sources of files being downloaded. It works this magic by identifying files that were similar to the one being downloaded, and finding identical pieces within those similar files. Thanks to the folks who developed the system, we had a chance to read the paper they are presenting that described the system in detail.

The obvious first hurdle in any such system is presenting the files to be shared in a way that allows similarities to be detected. P2P filesharing systems already split files into chunks typically based on byte length, allowing clients to retrieve different chunks from different sources. This would mask similar but nonidentical files, however, as any change that altered the total bytes would throw off the register of the information in any chunks that followed it. SET works around this by using a process called Rabin fingerprinting.

Using this method, chunk boundaries are set based on a combination of size and the properties of the data itself. For example, chunk boundaries could be determined based on rules such as "16 Kb after the last boundary at the first place that four bits in a row are zeroes." Using this sort of method allows the chunks to share common boundaries even when intervening differences throw them off register.

Once a file is split in chunks, a hash value can then be calculated for each chunk, allowing identical chunks to be recognized. But sharing and comparing all the hashes for the chunks of all available files would be horribly inefficient, as it would require significant network traffic and computational resources. To avoid these problems, the authors devised a rapid comparison scheme they termed "handprinting" that involves a single, small data transfer.

Once the hashes are calculated, they sorted them lexicographically (similar to alphabetical sorting). The first 30 of the resulting hashes were defined as the "handprint" of the file. Those handprints can then be sent to the global filesharing lookup table, allowing new clients to rapidly access them. By comparing two handprints, a rough measure of the total similarities between two files can be determined. Only when the similarity of a handprint in two different files was above a certain threshold would the full list of chunk hashes be compared, and identical pieces of the files could then be downloaded.

There are three variables that need to be set arbitrarily in the SET system, which the authors chose to fill empirically. The first is chunk size: bigger chunks would make for more efficient transfers, but produce fewer similar chunks when differences are evenly distributed in the files. They chose a 16Kb chunk because that size captured the majority of possible similarities while only imposing a one percent overhead.

A second variable was the number of chunk hashes that go into a handprint. They calculated that using 28 of them would give them a greater than 90 percent chance of identifying any two files that were more than 10 percent similar, so they tested a handprint of 30 chunk hashes against a set of movie and MP3 files obtained from P2P networks. This successfully pulled out 99 percent of the similar files, which they considered to be a pretty good rate. The final issue was where to set the similarity threshold that determines whether a file is similar enough to perform a full comparison. Here, tests showed that a 10 percent similarity provided a good balance between overhead and the identification of similar files.

With these parameters in place, they tested their system on a number of simulated networks, and compared it to BitTorrent clients on the same networks. In general, the speedups from SET were most dramatic when the network performance was worst. Speedup was minimal when any server was on a fast network, because a single server could saturate the clients. Put both servers and clients on slow, asymmetric DSL links, and SET provided a huge boost. In the real world, these inefficient conditions appear to predominate: the authors cite studies showing that over 60 percent of P2P downloads are never completed, and the median transfer time for a 100MB file on Kazaa was over a day.

Given that data, It looks like the SET technique could have a big impact on future P2P services. The authors also noted that their technique is simply a method for rapidly identifying more sources of the file that's being downloaded. There's also research going on into speeding the data transfers themselves, and the two improvements could be combined to produce an aggregate speedup. Hopefully, the resulting software will cut down on the most inefficient aspect of the whole process: the downloads that are started but never completed.


 
2007-03-06 10:01
我不是做这个方向的,因此仅仅做一个记录,以防今后会用上。

http://peersim.sourceforge.net/

http://pdos.csail.mit.edu/p2psim/

http://www.p2parch.de/RealPeer/

http://www3.ietf.org/proceedings/06mar/slides/P2PRG-1.pdf

最后一个是IETF老大们出的纲领性文件,非常值得收藏。
 
 
   
 
 
文章分类
 
   
 
文章存档
 
     
 
最新文章评论
  

NS高手啊,多交流。。。
 

看来这是个牛人啊,有问题多指教。。。
 

这个咱们完全可以搞,不过工程量巨大!
 

不错啊!
 

对于下面注释的内容深有体会,哈哈。
   
帮助中心 | 空间客服 | 投诉中心 | 空间协议
©2012 Baidu