Open source data integration tools may sometime lack in features when compared to their proprietary cousins such as Informatica, DataStage. However, depending on the tool, they do have a big and strong user community that develops new functionality all the time.
Here are some questions you need to ask yourself before making the decision of switching to open source data integration.
- How fast is your team moving?
- What development methodology are you using?
- Are people motivated in your team to learn bleeding edge software?
- What is your budget?
- How complicated is the data?
- Does the tool do stuff you wan out of the box?
- Can the tool easily integrate with your current scheduling system and data warehouse maintenance/data load teams?
- What is the latency requirement (speed of the updates)?
- Does the tool provide ability to be called as web service?
- How well does the tool implement with your existing technology ecosystem (i.e. if the tool is java based it can utilize your enterprise jars if you are a java shop)
- Does it allow hand-coding modules?
Open source tools such as Talend have plenty of resources available online and generally have low cost of procurement and faster implementation times. I've been using them for a while and really like the ease of working with them. But depending on your speed requirements you may or may not want to use them.