Just published, the Last Name API is capable to infer the origin of a given last name. For example, Fonzarelli is a surname of Italian origin with a confidence score of 99.41 %.
The API is built using the excellent framework, ML.Net. It uses a training dataset of 20 000 surnames. But, there is always room for improvement. If you have any surnames of any origin that you can share, please send them to me, to update the training dataset. More surname, better accuracy. At this time, the dataset have data in 18 categories , including Arabic, Chinese, Czech, Dutch, English, French, German, Greek, Irish, Italian, Japanese, Korean, Polish, Portuguese, Russian, Scottish, Spanish, Vietnamese. If you have a list of Indian, Romanian or any other surnames, let me know.
Your feedback is more than welcome.
Image by Holly Mindrup