I work in industrial agriculture, and we’re trying to merge data from several different hardware sensors located on different farms. Each sensor has its own internal database using simple IDs. Now that we're centralizing everything into one big Data Lake,
we have thousands of records with the ID 1
or 2
. It’s a total mess for our analytics. My boss asked if we should just prefix
the IDs with the farm name, but that feels like a hack. Is there a more professional way to handle global uniqueness in large-scale data systems?