Merge pull request 'pull from docs dev branch' (#7) from hush/docs:dev into master

Reviewed-on: https://git.hush.is/jahway603/hush-docs/pulls/7
3 years ago · 0ec78d8a44
2 changed files with 215 additions and 0 deletions
--- a/advanced/devops.md
+++ b/advanced/devops.md
@ -0,0 +1,83 @@
+# Hush DevOps
+
+This documentation is for services that run a Hush full node, such as mining pools, exchanges
+and lite wallet node operators. It covers advanced things to know which can help reduce downtime.
+
+## Minimize Rescan Downtime
+
+Sometimes a Hush node must rescan the history of the blockchain, to look for incoming and outgoing
+transactions. It normally happens when importing a new private key, but can also happen when some
+type of crash happens. For instance:
+
+  * The physical server or VM runs out of memory
+  * There is a power outage
+  * A bug in hushd itself causes a coredump and unclean exit
+  * kill -9 hushd is used
+
+If the above circumstances happen, hushd will not be able to do it's normal "bookkeeping" when
+doing a clean exit. Part of this work is updating metadata about shielded transactions, called "witnesses".
+When hushd starts and notices this data is out of date, it performs a *full rescan of all time* to update
+the data, which works, but it's very slow and will cause downtime of potentially hours or longer.
+
+To avoid this downtime, first make sure to be using at least Hush 3.8.0 which has two new features
+
+  * `-rescanheight` is an added option to `-rescan` that gives a specific height to rescan from instead of the genesis block
+  * `-keepnotewitnesscache` which means keep the partial ztx metadata and update it, instead of recreating it from scratch
+
+For example, lets say from the debug.log that a system admin knows that the last block height the node processed
+was 12345. This can be found by looking for the very last UpdateTip line in the debug.log. That means witness data is correct
+up to that height, and that is what we can use as a starting point. So the sysadmin would do
+
+    hushd -rescan=1 -rescanheight=12345 -keepnotewitnesscache=1
+
+and then hushd will only rescan the history from block 12345 until the current block, which saves huge amount of time.
+
+### What can go wrong?
+
+Lets say that the sysadmin uses the wrong `-rescanheight` on accident, what can go wrong? Is it possible to lose funds?
+No, it is not possible to lose access to funds with this method, especially if you have backups of the wallet.dat and private keys.
+If the sysadmin uses a block height further back in history, say 12300, that is perfectly fine. The node will do some extra
+work and take slightly longer. If the sysadmin uses a block height that is *too high*, such as 12500, it
+could cause hushd to be unaware of any transactions between block 12345 and 12500. If incoming transactions occurred, they will not
+be reflected in wallet balances and if outgoing transactions occured, wallet errors may happen when trying to spend shielded funds
+which hushd things are *unspent* but were actually spent between blocks 12345 and 12500.
+
+The fix is simple, the sysadmin can just stop and restart hushd with a `-rescanheight` further back in the past, such as block 12345
+or earlier. If the sysadmin does not know an exact block height to use for `-rescanheight` than a conservative estimate in the past
+can be used, such as a block height from one month or even one year ago. This will still help avoid much of blockchain history during
+the rescan and will still result in drastically less downtime while the node is rescanning.
+
+## My RPC interface stops responding to requests!
+
+It is sometimes possible for the RPC interface to stop answering requests, due to a bug inherited from BTC/ZEC and made worse
+by the privacy additions inside of HUSH. There are two different mitigations Hush has added to hushd to deal with this, while
+the underlying problem is researched more.
+
+### PLZ STOP
+
+If the RPC interface stops answering requests, it becomes impossible to run `hush-cli stop` and sysadmins are forced to use `kill`
+to stop the node, which can lead to problems described above which hushd crashing. So Hush developers made a purely disk-based method
+to ask hushd to stop. A sysadmin simply creates a file called `plz_stop` in the same directory as `wallet.dat`. Every 120 seconds, `hushd`
+looks for this file and will stop the node if it is found. The file can be of zero size, it's contents do not matter. So for instance:
+
+```
+cd ~/.hush/HUSH3 # or cd ~/.komodo/HUSH3 for legacy locations
+touch plz_stop
+# hushd will stop within 2 minutes
+```
+
+With the above "trick", you can avoid using `kill` and hushd will do it's ztx bookkeeping just before it shuts down, which avoids long rescans.
+
+
+### RPC Work Queue
+
+One way to avoid the "RPC deadlock bug" or make it much rarer is to increase the size of the "RPC work queue" in HUSH3.conf like this:
+
+```
+rpcworkqueue=8192
+```
+
+This makes it take much longer and less likely for all RPC "slots" to be deadlocked at the same time, which prevents issuing RPC commands to hushd.
+Larger numbers can be used, and in general, the more zaddrs and transactions a wallet has, the more likely it will run into a deadlock and need a higher
+`rpcworkqueue` value. This method uses a few more kilobytes of RAM to have more slots, which is a good trade to avoid downtime and node maintenance.
+
--- a/protocol/README.md
+++ b/protocol/README.md
@ -0,0 +1,132 @@
+# Hush Protocol Documentation
+
+This documentation covers low-level internals of how Hush Protocol is different
+from the Bitcoin Protocol and Zcash Protocol which Hush Protocol builds upon.
+This documentation is for Hush (v3) as opposed to the original Hush codebase, which
+was based on an older version of Zcash Protocol.
+
+This documentation assumes some familiarity with how Bitcoin Protocol works, but
+does not assume knowledge of Zcash Protocol or other privacy coins.
+
+# Hush Addresses
+
+Hush addresses work in essentially the exact same way as Bitcoin Protocol addresses,
+except that Hush has both transparent addresses (taddrs) and shielded addresses (zaddrs).
+Taddrs in Hush work exactly the same as their counterparts in Bitcoin Protocol. There is
+a public key which when hashed with Base58Check algorithm, gives the public address of
+that public key. Hush also has "compressed" or "uncompressed" public keys just like BTC.
+Hush taddrs are CASE-SENSITIVE, just like BTC.
+
+## Hush taddrs
+
+Hush Protocol forked from Bitcoin Protocol before Segwit existed, so it has no complications
+of "segwit-ish" things. Hush essentially has the Bitcoin 0.11.2 protocol frozen deep inside it,
+with small improvements and adjustments. Just like originally all BTC mainnet addresses started with a 1 or 3,
+Hush has taddrs that start with either R or b. This is related to HUSH having it's own base58 prefix.
+All Hush Smart Chains will have compatible addresses by using the same base58 prefix, which aids
+integration.
+
+## Hush zaddrs
+
+Hush zaddrs are Sapling Shielded addresses start with `zs1` in Bech32 format, which means they
+are CASE-INSENSITIVE. This makes zaddrs more efficient when stored in QR codes. Hush Smart Chains
+also will have addresses starting with `zs1`.
+
+# Hush Transactions
+
+Originally in 2016 Hush started as a fork of Zcash 1.x and had *optional privacy*. We decided that was a bad
+idea and transitioned to *enforced privacy* as of Block 340000 of the HUSH v3 chain in November 2020. So originally
+a transaction could have a taddr as an output (receiving funds) but that is no longer possible. We call this "z2z"
+because it means normal transactions must be from a zaddr and to a zaddr. Mining new funds and DPoW still use taddrs
+since that data must be public.
+
+When a new block is mined on HUSH, 90% of the funds go to the taddr of the miner and 10% goes to the address of
+the Founder Reward (or Developer Tax, however you like to think of it) which is `RHushEyeDm7XwtaTWtyCbjGQumYyV8vMjn` .
+
+Zcash has never and seemingly never will mandate zaddr usage, so z2z transactions are rare on their network. In addition
+to z2z, Hush has something called Sietch which obscures even more metadata about shielded transactions, specifically,
+the ordering of the outputs and change and the number of outputs.
+
+By default Zcash z2z transactions will have two outputs, where Hush will have eight on average. Individual transactions
+will have between seven and nine shielded outputs which is non-deterministically chosen at transaction creation time.
+
+# Hush Peer-to-Peer (P2P) layer
+
+Hush has mandatory TLS 1.3 encryption of all network connections, the first cryptocoin to accomplish this. What it means
+is that your ISP, network administrator and many low-level internet operators no longer can easily see exactly what you
+are doing, the details of your transactions and which IP addresses make them at which times is blinded to them, amongst
+lots of other metadata. Bitcoin, Zcash and Monero all broadcast all network data in plaintext, which leaks massive amounts
+of data to network eavesdroppers. Since everything is public already on Bitcoin, it makes sense that Satoshi did not use
+TLS. But for privacy coins, encrypted connections is a must to lessen metadata leakage.
+
+Mandatory TLS means that people recording the internet don't get to see all your data,
+and that is very good. To this date, all privacy coins known to the author use plaintext or optionally use TLS, no coin
+except Hush requires privacy for all network connections!
+
+What if the attacker is willing to run one or more nodes?  In both Bitcoin/Zcash protocol, it is simple to have one or many nodes on the network
+that listen for which IP address is first relaying a transaction, which is a very good indication of that node having created that transaction.
+Monero uses the (imperfect, but better than nothing) Dandelion Protocol method of improving network privacy, and Hush was inspired by that to
+make improvements against so-called "Sybil Attacks". 
+
+A "Sybil Attack" is a form of Denial-of-Service where the attacker is willing and able to run many nodes. It is often used to attack the Tor Network,
+where a large percentage of Tor Exit Nodes will become malicious and reduce the privacy of the network. Zcash and Monero both have privacy failures
+when a Sybil Attack is able to run a small percentage of the network, such as 10%. If a Sybil Attack would be able to run 50% or even 99% of the network,
+the privacy failures just get worse.
+
+The way the Hush p2p layer is designed, even if the Sybil Attacker is running 99% of the nodes on the network, they still would only have, less than a 50% chance of being able to tell which node created a transaction. This also means that under normal circumstances, it is impossible for a network eavesdropper or malicious node(s) on the network to identify which node created which transaction. While Bitcoin and Zcash Protocol are optimized to reduce bandwidth of the network, which makes it vulnerable to Sybil Attacks, Hush chooses to use more bandwidth and slightly more network latency to gain privacy.
+
+## P2P Details
+
+The number of maximum outbound peers in 8 in Bitcoin, 16 in Zcash and 64 with Hush. This necessarily means more network traffic and higher bandwidth usage
+in Hush versus Zcash or Bitcoin, which is increased even more by our use of TLS 1.3 instead of plaintext. Privacy is worth it to us, so we are happy to trade bandwidth
+for privacy.
+
+In Bitcoin and Zcash, when a node makes a brand new transaction, it relays it to all known outbound peers that it knows about. This is very good for resiliency,
+since it spreads the new transaction to many nodes at once. It is very bad for privacy, because of something called the "First Timestamp Estimator" from the paper
+"Deanonymization in the bitcoin P2P network" https://papers.nips.cc/paper/2017/file/6c3cf77d52820cd0fe646d38bc2145ca-Paper.pdf .
+
+
+Nodes can listen
+to P2P traffic, which is never written to the public "ledger" data viewable on an explorer, and estimate which node relayed a transaction first. This is very
+likely the node that created it, or a service that helped create the transaction, such as a lite wallet server. The more nodes you are willing to have listen,
+the more likely you are to guess correctly. If you have two nodes that say two different IP addresses made a transaction, the one with earlier timestamp is the
+better guess.
+
+It is trivial on Bitcoin and Zcash networks to run nodes to perform this "First Timestamp Estimator" and the Hush community assumes this already happens and is
+embedded into blockchain analysis softare already. This means that blockchain analysis companies can likely already tell the IP address of Bitcoin and Zcash
+transactions with high likelihood.
+
+## Defeating "First Timestamp Estimator" (FTE)
+
+
+Dandelion is one way to try to defeat it, but the internals implementation is complex and can potentially add new attacks, which is why it was never merged
+into Bitcoin. Monero does have Dandelion, but attacks such as BADCACA show it is still vulnerable to Sybil Attacks. Dandelion++ is an improvement to Dandelion
+that fixes many issues with the original Dandelion algorithm, but so far it seems no cryptocoin fully implements it, likely because of the complexity of how
+it changes network and mempool internals.
+
+Hush was motivated by the question "What is the simplest way to defeat the first timestamp estimator?". We also wanted to optimize for very bad conditions,
+of Sybil Attacks that run 50-99% of nodes, not small Sybil Attacks when the attacker runs less than half the nodes in the network. Implementing Dandelion
+requires changing how the mempool works and we also wanted to avoid that, since it's very hard to know if new vulnerabilities are being introduced.
+
+Hush's method to defeat the FTE is choosing a *random subset of peers* at transaction relay time, and only relaying the transaction to those peers.
+This means that it takes longer for Hush transactions to fully propagate across the network. Hush is happy to trade some network latency for privacy and since this
+latency is on the order of seconds, it does not slow down anything noticeable to end users. Currently Hush relays transactions to half of it's outbound nodes, and rounds
+down in case of odd numbers. So for instance, if a Hush node has 5 outbound connections, it will relay to a random subset of 2 of them, which is 40%. Rounding down
+helps make things harder on the attacker for nodes with small numbers of outbound connections, such as those who have only recently joined the network.
+
+Imagine a scenario where the network is 50% Sybil nodes, which are using the FTE to correlate which nodes created which transactions, i.e. creating 
+a mapping between IP address and transaction id. With the Hush transaction relay algorithm, it's possible for a Sybil node to be directly connected to the node
+making a transaction, yet that node might not relay the transaction to the Sybil node, since that outbound node is not one of the randomly 50% of peers that is chosen
+when the transaction is made.
+
+If we assume that 50% of our nodes are Sybil nodes and we only relay our transaction to 50% of our connections, then on average, the FTE will only be correct
+25% of the time. As we assume the Sybil Attacker has a larger percentage of network nodes, the FTE will increase in accuracy but never be more than 50% accurate, which is
+not very good at all.
+
+Nodes with small numbers of connections also do well against the attacker. If a node has only 3 outbound connections and makes a transaction, it will only relay it to 1
+node, which is 33%. If we assume a Sybil Attacker with 50% of network nodes, the FTE will only be accurate about 17% of the time in trying to decide which node made the
+transaction. In the same situation, a Hush node with 15 connections will result in an FTE with ~23% accuracy.
+
+## Tor P2P Support
+
+Tor v2 was deprecated by the Tor network and they have changed to v3, and Hush is currently in the process of merging Tor v3 support from BTC Core.