
The discussion has mostly been emotional to date. Here are some statistics (not necessarily perfect, but pretty close)...
Total number of files (after removing the .git files): $ find . -type f | wc -l 6055
Number of files that are identified as being copyrighted: $ grep -l -i -r 'Copyright' * | wc -l 5173
List of files with copyrights: $ grep -l -i -r 'Copyright' * | sort > ~/ucopy.txt
List of all files: $ find . -type f | sed 's/.///' | sort > ~/ufiles.txt
List of files *WITHOUT* the string "copyright" in them: $ comm -3 ~/ufiles.txt ~/ucopy.txt > ~/nocopyright.txt
Of the above, 130 of the files have the GPL in the header but not the string "copyright" - incomplete headers: for file in `cat ~/nocopyright.txt` ; do grep -il 'General Public License' $file ; done | wc -l 130
---------------------------------------------------------------------
Total number of files that are GPLv2: $ grep -i -r 'Free Software Foundation' * | grep -i 'version 2' > ~/ugplv2.txt $ wc -l ~/ugplv2.txt 4588
Number of files that are GPLv2 *or later*: $ grep -i -r 'Free Software Foundation' * | grep -i 'either version 2' | wc -l 4512
76 files are GPLv2 *ONLY*: $ grep -i -v 'either version 2' ~/ugplv2.txt | awk '{print $1}' | sed 's/:$//' board/stxgp3/ddr.c board/netstar/eeprom_start.S board/sbc8560/ddr.c board/mpc8540eval/ddr.c board/socrates/ddr.c board/pm856/ddr.c board/freescale/p2020ds/ddr.c board/freescale/mpc8541cds/ddr.c board/freescale/mpc8641hpcn/ddr.c board/freescale/mpc8555cds/ddr.c board/freescale/mpc8536ds/ddr.c board/freescale/mpc8568mds/ddr.c board/freescale/mpc8548cds/ddr.c board/freescale/mpc8610hpcd/ddr.c board/freescale/mpc8544ds/ddr.c board/freescale/mpc8560ads/ddr.c board/freescale/mpc8572ds/ddr.c board/freescale/mpc8569mds/ddr.c board/freescale/mpc8540ads/ddr.c board/atum8548/ddr.c board/stxssa/ddr.c board/xes/xpedite5200/ddr.c board/voiceblue/voiceblue.c board/voiceblue/eeprom_start.S board/voiceblue/setup.S board/pm854/ddr.c board/sbc8641d/ddr.c board/sbc8548/ddr.c common/ddr_spd.c cpu/mpc86xx/fdt.c cpu/mpc86xx/ddr-8641.c cpu/mpc85xx/ddr-gen3.c cpu/mpc85xx/ddr-gen2.c cpu/mpc85xx/ddr-gen1.c cpu/mpc8xxx/ddr/ctrl_regs.c cpu/mpc8xxx/ddr/Makefile cpu/mpc8xxx/ddr/ddr.h cpu/mpc8xxx/ddr/ddr2_dimm_params.c cpu/mpc8xxx/ddr/common_timing_params.h cpu/mpc8xxx/ddr/ddr1_dimm_params.c cpu/mpc8xxx/ddr/options.c cpu/mpc8xxx/ddr/main.c cpu/mpc8xxx/ddr/util.c cpu/mpc8xxx/ddr/ddr3_dimm_params.c cpu/mpc8xxx/ddr/lc_common_dimm_params.c drivers/gpio/pca953x.c drivers/pci/fsl_pci_init.c drivers/misc/ds4510.c drivers/mtd/nand/nand.c drivers/i2c/fsl_i2c.c drivers/usb/host/ehci-pci.c drivers/usb/host/r8a66597.h drivers/usb/host/ehci.h drivers/usb/host/r8a66597-hcd.c drivers/usb/host/ehci-core.h drivers/usb/host/ehci-hcd.c drivers/mmc/omap3_mmc.c include/asm-ppc/fsl_ddr_sdram.h include/asm-ppc/fsl_i2c.h include/asm-ppc/fsl_dma.h include/asm-ppc/mpc8xxx_spi.h include/asm-ppc/fsl_ddr_dimm_params.h include/pca953x.h include/ds4510.h include/configs/MPC8610HPCD.h include/configs/voiceblue.h include/spi_flash.h include/ddr_spd.h include/asm-m68k/fsl_i2c.h include/addr_map.h include/sha1.h include/nand.h include/asm-arm/arch-omap3/mmc.h include/asm-arm/arch-omap3/mmc_host_def.h lib_generic/sha1.c lib_generic/addr_map.c
Number of files that are BSD licensed (but the seven (7) libfdt files are dual-licensed "GPLv2 or later" / BSD): $ grep -r 'EXPRESS OR IMPLIED WARRANTIES' * | wc -l 156
Number of doc/* files (most have no copyright statement): $ find doc/ -type f | wc -l 147
Number of doc/* files that *do* have a copyright statement: $ grep -il copyright doc/* | wc -l 15
This implies... 156 - 7 = 149 files use the BSD license (7 dual licensed) 5173 - 4588 - 149 = 436 files have license header problems or a different license? 6055 - 5173 = 882 files don't have a copyright statement in them. 147 - 15 = 132 doc/* files have no copyright 882 - 132 = 750 files are not doc/* files and don't have copyright
Best regards, gvb