microwatt

Commit Graph

Author	SHA1	Message	Date
Anton Blanchard	71d4b5ed20	core_debug: Initialise gspr_index Another case of U state being driven out of a module. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Anton Blanchard	1ff852b012	Merge pull request #369 from antonblanchard/loadstore-pmu-init loadstore1: Initialise PMU events	4 years ago
Anton Blanchard	e2438071a1	loadstore1: Initialise PMU events The loadstore1 PMU events are U state until a load and a store completes. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Anton Blanchard	b7c4d3c5c3	Merge pull request #367 from antonblanchard/fpu-typo fpu: Fix capitalisation of Execute1ToFPUType	4 years ago
Anton Blanchard	64d2def0c6	fpu: Fix capitalisation of Execute1ToFPUType While this is not an issue in VHDL, I noticed this when running a script over the source and we may as well fix it. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Anton Blanchard	b8fc5636a4	Merge pull request #365 from antonblanchard/less-fpga-init Remove some FPGA style signal inits	4 years ago
Anton Blanchard	ebdddcc402	Remove some FPGA style signal inits These don't work on the ASIC flow, so remove them and initialise them explicitly where required. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Anton Blanchard	a750365ffa	Remove some FPGA style signal inits These don't work on the ASIC flow, so remove them and initialise them explicitly where required. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Michael Neuling	f5e06c2d4b	Merge pull request #361 from antonblanchard/alt-reset-address Allow ALT_RESET_ADDRESS to be overridden	4 years ago
Anton Blanchard	948f6f43a7	Allow ALT_RESET_ADDRESS to be overridden This allows us to boot from flash for example. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Michael Neuling	8bf48ac094	Merge pull request #360 from antonblanchard/log2ceil-issue wishbone_bram_wrapper ram_addr_bits is 1 bit off	4 years ago
Anton Blanchard	b5accb78b2	wishbone_bram_wrapper ram_addr_bits is 1 bit off log2ceil() returns the number of bits required to store a value, so we need to pass in memory_size-1, not memory_size. Every other user of log2ceil() gets this right. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Michael Neuling	30fd936c12	Merge pull request #358 from antonblanchard/unused-sig Remove unused sequential signal from Fetch1ToIcacheType	4 years ago
Michael Neuling	af1b76d944	Merge pull request #356 from antonblanchard/fpu-constant fpu: Make inverse_table a constant	4 years ago
Michael Neuling	9b96ab730c	Merge pull request #357 from antonblanchard/xics-warning xics: Fix warning when comparing two std_ulogic_vectors	4 years ago
Anton Blanchard	0b39947f8d	Remove unused sequential signal from Fetch1ToIcacheType GHDL synthesis is flagging a warning about this. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Anton Blanchard	00bf0af21c	xics: Fix warning when comparing two std_ulogic_vectors Use unsigned() to make it clear what we are doing. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Anton Blanchard	50b4cb9423	fpu: Make inverse_table a constant GHDL synthesis is complaining that inverse_table is never stored to. Change it to a constant. Signed-off-by: Anton Blanchard <anton@linux.ibm.com>	4 years ago
Michael Neuling	f01f3d233a	Merge pull request #352 from mkj/static-urjtag mw_debug: Add STATIC_URJTAG flag	4 years ago
Matt Johnston	c0c00d05bc	mw_debug: Add STATIC_URJTAG flag Revert to linking dynamically by default, can statically link with `make STATIC_URJTAG=1` Fixes #351 Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Michael Neuling	ffcdaaa92d	Update the README Issues (#350 ) We've had these for a while now: - D/I cache - GPR bypassing - Supervisor state (and can boot linux) We still need Vector/VMX/VSX (and probably some other things) Signed-off-by: Michael Neuling <mikey@neuling.org>	4 years ago
Michael Neuling	b4770197a2	Merge pull request #349 from madscientist159/master Extend LiteDRAM VHDL wrapper to allow more than one clock line	4 years ago
Raptor Engineering Development Team	fcb783a0fb	Extend LiteDRAM VHDL wrapper to allow more than one clock line This is necessary for the upcoming Arctic Tern system enablement, since Arctic Tern uses two DRAM devices and a separate clock line is routed to each device. LiteX handles this behavior correctly, therefore we assume other hardware exists that uses a similar DRAM clock design. Updates from Mikey to fix some compile issues. Signed-off-by: Timothy Pearson <tpearson@raptorengineering.com> Signed-off-by: Michael Neuling <mikey@neuling.org>	4 years ago
Michael Neuling	2b97fb0bf3	Merge pull request #348 from paulusmack/reduce Reduce LUT usage	4 years ago
Paul Mackerras	0aa898c7a6	xics: Rework the irq_gen process At present, the loop in the irq_gen process generates a chain of comparators and other logic to work out the source number and priority of the most-favoured (lowest priority number) pending interrupt. This replaces that chain with (1) logic to generate an array of bits, one per priority, indicating whether any interrupt is pending at that priority, (2) a priority encoder to select the most favoured priority with an interrupt pending, (3) logic to generate an array of bits, one per source, indicating whether an interrupt is pending at the priority calculated in step 2, and (4) a priority encoder to work out the lowest numbered source that has an interrupt pending at the selected priority. This reduces LUT utilization. The priority encoder function implemented here uses the optimized count-leading-zeroes logic from helpers.vhdl. Signed-off-by: Paul Mackerras <paulus@ozlabs.org>	4 years ago
Paul Mackerras	1720a0584a	Use alternative count-leading-zeroes algorithm in the FPU and LSU Signed-off-by: Paul Mackerras <paulus@ozlabs.org>	4 years ago
Paul Mackerras	1086988883	countzero: Use alternative algorithm for higher bits This implements an alternative count-leading-zeroes algorithm which uses less LUTs to generate the higher-order bits (2..5) of the result. By doing (v \| -v) rather than (v & -v), we get a value which has ones from the MSB down to the rightmost 1 bit in v and then zeroes down to the LSB. This means that we can generate the MSB of the result (the index of the rightmost 1 bit in v) just by looking at bits 63 and 31 of (v \| -v), assuming that v is 64 bits. Bit 4 of the result requires looking at bits 63, 47, 31 and 15. In contrast, each bit of the result using (v & -v), which has a single 1, requires ORing together 32 bits. It turns out that the minimum LUT usage comes from using (v & -v) to generate bits 0 and 1 of the result, and using (v \| -v) to generate bits 2 to 5. This saves almost 60 6-input LUTs on the Artix-7. Signed-off-by: Paul Mackerras <paulus@ozlabs.org>	4 years ago
Paul Mackerras	4cf2921b0b	soc: Re-do peripheral address decode to improve timing This generates a series of io_cycle_* signals which are clean latches and which become the 'cyc' signals of the wishbone buses going to various peripherals (syscon, uarts, XICS, GPIO, etc.). Effectively this is done by moving the address decoding into the slave_io_latch process. The slave_io_type, which drives the multiplexer which selects which wishbone to look for a response on, is reduced to just 8 values in the expectation that an 8-way multiplexer will use less logic than one with more than 8 inputs. With this timing is considerably better on the A7-100T. Signed-off-by: Paul Mackerras <paulus@ozlabs.org>	4 years ago
Michael Neuling	27b660ef76	Merge pull request #346 from mkj/dmi_ecp5 Add DMI and mw_debug for ECP5	4 years ago
Anton Blanchard	5a5a082601	Merge pull request #343 from mikey/orange-crab-ci ci: Add new Orange Crab build	4 years ago
Matt Johnston	9c64f8a98b	mw_debug: Add Lattice ECP5 support "-b ecp5" will select ECP5 interface that talks to a JTAGG primitive. For example with a FT232H JTAG board: ./mw_debug -t 'ft2232 vid=0x0403 pid=0x6014' -s 30000000 -b ecp5 mr ff003888 6 Connected to libftdi driver. Found device ID: 0x41113043 00000000ff003888: 6d6f636c65570a0a ..Welcom 00000000ff003890: 63694d206f742065 e to Mic 00000000ff003898: 2120747461776f72 rowatt ! 00000000ff0038a0: 0000000000000a0a ........ 00000000ff0038a8: 67697320636f5320 Soc sig 00000000ff0038b0: 203a65727574616e nature: Core: running NIA: c0000000000187f8 MSR: 9000000000001033 Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	3775650df3	dmi_dtm_ecp5: Use ECP5 JTAGG for DMI This uses the JTAGG primitive which is similar to BSCANE2. The LUT4 delay approach came from Florian and Greg in https://github.com/enjoy-digital/litex/pull/1087 Has been tested on an OrangeCrab with 48MHz sysclk FT232H up to 30MHz (though libusb/urjtag is by far the bottleneck vs the JTAG clock) Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	eb20195a10	mw_debug: Link urjtag statically liburjtag isn't in Debian, so usually we're pointing at a urjtag build directory when building mw_debug Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	763138798e	mw_debug: use isxdigit for hex arguments Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	04cc4a842c	mw_debug: Add -s frequency argument Chose -s for speed, vs -f for --force Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	e05ae0c8cb	mw_debug: pass target parameters to urjtag An example ./mw_debug -d -t 'ft2232 vid=0x0403 pid=0x6014' Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Paul Mackerras	49ec80ac3e	fetch1/icache1: Remove the use_previous logic This removes logic that I added some time ago with the thought that it would enable us to do prefetching in the icache. This logic detects when the fetch address is an odd multiple of 4 and the next address in sequence from the previous cycle. In that case the instruction we want is in the output register of the icache RAM already so there is no need to do another read or any icache tag or TLB lookup. However, this logic adds complexity, and removing it improves timing, so this removes it. Signed-off-by: Paul Mackerras <paulus@ozlabs.org>	4 years ago
Paul Mackerras	cef3660e74	Merge pull request #345 from antonblanchard/popcnt-go-fast popcnt* timing improvements from Paul	4 years ago
Paul Mackerras	2491aa7fc5	core: Make popcnt* take two cycles This moves the calculation of the result for popcnt* into the countbits unit, renamed from countzero, so that we can take two cycles to get the result. The motivation for this is that the popcnt* calculation was showing up as a critical path. Signed-off-by: Paul Mackerras <paulus@ozlabs.org>	4 years ago
Michael Neuling	286757f0f7	ci: Add new Orange Crab build This builds the Orange Crab v0.21 + litedram image Signed-off-by: Michael Neuling <mikey@neuling.org>	4 years ago
Michael Neuling	6ff3b2499c	Merge pull request #342 from mkj/orangecrab-merge Orangecrab working with litedram Fixed up a few simple merge conflicts in the Makefile.	4 years ago
Michael Neuling	cdd661d844	Merge branch 'master' into orangecrab-merge	4 years ago
Michael Neuling	fda8879e2f	Merge pull request #341 from mkj/progtools orangecrab programming targets	4 years ago
Michael Neuling	ffbf2f9964	Merge pull request #340 from mkj/orangecrab-ghdl-plugin Makefile: detect when ghdl is a yosys plugin	4 years ago
Matt Johnston	049f0549d8	orangecrab: Fix sdcard wishbone addressing Orangecrab missed out on: Make wishbone addresses be in units of doublewords or words Author: Paul Mackerras <paulus@ozlabs.org> Date: Wed Sep 15 18:18:09 2021 +1000 Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	abc6a4f372	orangecrab: use litesdcard Currently not working (tested in Linux) Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	42959184dd	litesdcard: add lattice, regenerate Modifies litescard generate script to take a clock speed. Regenerated verilog with latest litesdcard e52c731 ("Bump year.") Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	d794cc70b1	orangecrab: No BTC, LOG_LENGTH, dram NUM_LINES Reduce litedram NUM_LINES 64->8 This allows us to meet timing. Can probably be improved in future with better BRAM usage. Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	a8d9203c5d	orangecrab: Use litedram Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago
Matt Johnston	57d4c4c117	orangecrab: set HAS_SHORT_MULT It seems free, generated as a single MULT18X18D Signed-off-by: Matt Johnston <matt@codeconstruct.com.au>	4 years ago

1 2 3 4 5 ...

1100 Commits (71d4b5ed200cdf813320f68e9b4cca7e51e60eb1) All Branches Search

1100 Commits (71d4b5ed200cdf813320f68e9b4cca7e51e60eb1)

All Branches