All kinds of symbols, including composites used in various programming languages.
This might be a good starting point, but should probably be replaced with a custom regex.