[mirror-admin] missing and extraneous files at kernel.org

Chris Schanzle schanzle at nist.gov
Thu Aug 13 14:50:46 EDT 2009


On 08/10/2009 05:24 PM, J.H. wrote:
> Carlos Carvalho wrote:
>   
>> J.H. (warthog19 at eaglescrag.net) wrote on 8 August 2009 15:00:
>>  >We don't, and never have, included any of the index.html files from 
>>  >upstream, that will not be fixed,
>>
>> Why? That sounds arbitrary. If you're a tier-1 mirror you should
>> replicate upstream, not decide on its contents. You can only exclude
>> whole portions of the tree due to resource limitation. This is not the
>> case for a few small files.
>>     
>
> In fact it is not arbitrary, and completely intentional.  Upstream 
> distro providers have a large number of people who have access to their 
> repositories, and I have an automated sync process.  Explicitly dropping 
> index.html files means several things:
>
> 	1) When a user browses to a page on mirrors.kernel.org they are
> 	   guaranteed that what they are seeing was generated from the
> 	   file listing, not by a rogue index.html page.  This means the
> 	   users can implicitly trust what they are seeing.
>
> 	2) It means I'm not unintentionally hosting websites under the
> 	   mirrors that I either don't know about or don't want to be
> 	   hosting.  I've got a lot of server / bandwidth resources I'm
> 	   sure there's a botnet somewhere that would love to sneak
> 	   something on there, I'm just trying to make things more
> 	   obvious if/when something like that does happen.
>
> 	3) This also means that I don't propagate these things to
> 	   mirrors who are mirroring from me.
>   

Skin the cat another way?  Disable Apache's sending index.html (I think 
DirectoryIndex in your httpd.conf) for the mirror tree directories.

IMHO, I think you should mirror everything upstream provides.  Besides, 
John, who's more authoritative to the public than kernel.org?  :-)


--


More information about the Mirror-admin mailing list