Web corpus of Samoan. Created by Bharat Ram Ambati using corpus Factory tools and then semi-automatic pruning in collaboration with Galumalemana Alfred Hunkin. The original size (before semi-automatic pruning) was 9 million, the final size is 3.5 million.