loopza are these kind of features available for other self-hosted software? I can't really see how you could block bots unless they use a specific user agent, or if there exists a PHP library to detect requests made by specific clients.
As for using your content, you set your own terms of use for your own forum. If companies steal your data, you'll have to seek legal advice.